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Preface 


This is an introductory textbook dealing with the design and analysis of experiments. It 
is based on college-level courses in design of experiments that I have taught for over 25 
years at Arizona State University, the University of Washington, and the Georgia Institute 
of Technology. It also reflects the methods that I have found useful in my own profes¬ 
sional practice as an engineering and statistical consultant in the general areas of product 
and process design, process improvement, and quality engineering. 

The book is intended for students who have completed a first course in statistical 
methods. This background course should include at least some techniques of descriptive 
statistics, the normal distribution, and an introduction to basic concepts of confidence 
intervals and hypothesis testing for means and variances. Chapters 10 and 11 require 
some familiarity with matrix algebra. 

Because the prerequisites are relatively modest, this book can be used in a second 
course on statistics focusing on statistical design of experiments for undergraduate stu¬ 
dents in engineering, the physical and chemical sciences, mathematics, and other fields 
of science. For many years I have taught a course from the book at the first-year graduate 
level in engineering. Students in this course come from all the traditional fields of en¬ 
gineering, physics, chemistry, mathematics, operations research, and statistics. I have 
also used this book as the basis of an industrial short course on design of experiments 
for practicing technical professionals with a wide variety of backgrounds. There are 
numerous examples illustrating all of the design and analysis techniques. These examples 
are based on real-world applications of experimental design and are drawn from many 
different fields of engineering and the sciences. This adds a strong applications flavor to 
an academic course for engineers and scientists and makes the book useful as a reference 
tool for experimenters in a variety of disciplines. 


ABOUT THE BOOK 

The fifth edition is a major revision of the book. I have tried to maintain the balance 
between design and analysis topics of previous editions; however, there are many new 
topics and examples, and I have reorganized much of the material. There is much more 
emphasis on the computer in this edition. During the last few years a number of excellent 
software products to assist experimenters in both the design and analysis phases of this 
subject have appeared. I have included output from two of these products, Minitab and 
Design-Expert, at many points in the text. Minitab is a widely available general-purpose 
statistical software package that has good data analysis capabilities, and which handles 
the analysis of experiments with both fixed and random factors (including the mixed 
model) quite nicely. Design-Expert is a package focused exclusively on experimental 
design. It has many capabilities for construction and evaluation of designs and extensive 
analysis features. A student version of Design-Expert is available with this book, and its 
use is highly recommended. I urge all instructors who use this book to incorporate com- 
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puter software into your course. (In my course, I bring a laptop computer and an overhead 
display panel to every lecture, and every design or analysis topic discussed in class is 
illustrated with the computer.) 

I have increased the emphasis on the connection between the experiment and the 
model that the experimenter can develop from the results of the experiment. Engineers 
(and physical and chemical scientists to a large extent) learn about physical mechanisms 
and their underlying mechanistic models early in their academic training, and throughout 
much of their professional careers they are involved with manipulation of these models. 
Statistically designed experiments offer the engineer a valid basis for developing an 
empirical model of the system being investigated. This empirical model can then be 
manipulated (perhaps through a response surface or contour plot, or perhaps mathemat¬ 
ically) just as any other engineering model. I have discovered through many years of 
teaching that this viewpoint is very effective in creating enthusiasm in the engineering 
community for statistically designed experiments. Therefore, the notion of an underlying 
empirical model for the experiment and response surfaces appears early in the book and 
receives much more emphasis. 

I have also made an effort to get the reader to the critical topics involving factorial 
designs much faster. To facilitate this, the introductory material on completely random¬ 
ized single-factor experiments and the analysis of variance has been condensed into a 
single chapter (Chapter 3). I have expanded the material on factorial and fractional fac¬ 
torial designs (Chapters 5—9) in an effort to make the material flow more effectively 
from both the reader’s and the instructor’s viewpoint and to place more emphasis on the 
empirical model. The chapter on response surfaces (Chapter 11) immediately follows the 
material on factorial and fractional factorial designs and regression modeling. I have 
expanded this chapter, adding new material on alphabetically optimal designs, experi¬ 
ments with mixtures, and the robust parameter design problem. Chapters 12 and 13 
discuss experiments involving random effects and some applications of these concepts 
to nested and split-plot designs. Chapter 14 is an overview of important design and 
analysis topics: nonnormality of the response, the Box-Cox method for selecting the 
form of a transformation, and other alternatives; unbalanced factorial experiments; the 
analysis of covariance, including covariates in a factorial design, and repeated measures. 

Throughout the book I have stressed the importance of experimental design as a tool 
for practicing engineers to use for product design and development as well as process 
development and improvement. The use of experimental design in developing products 
that are robust to environmental factors and other sources of variability is illustrated. I 
believe that the use of experimental design early in the product cycle can substantially 
reduce development lead time and cost, leading to processes and products that perform 
better in the field and have higher reliability than those developed using other approaches. 

The book contains more material than can be covered comfortably in one course, 
and I hope that instructors will be able to either vary the content of each course offering 
or discuss some topics in greater depth, depending on class interest. There are problem 
sets at the end of each chapter (except Chapter 1). These problems vary in scope from 
computational exercises, designed to reinforce the fundamentals, to extensions or elab¬ 
oration of basic principles. 

My own course focuses extensively on factorial and fractional factorial designs. 
Consequently, I usually cover Chapter 1, Chapter 2 (very quickly), most of Chapter 3, 
Chapter 4 (excluding the material on incomplete blocks and only mentioning Latin 
squares briefly), and I discuss Chapters 5 through 8 on factorials and two-level factorial 
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and fractional factorial designs in detail. To conclude the course, I introduce response 
surface methodology (Chapter 11) and give an overview of random effects models (Chap¬ 
ter 12) and nested and split-plot designs (Chapter 13). I always require the students to 
complete a term project that involves designing, conducting, and presenting the results 
of a statistically designed experiment. I require them to do this in teams, because this is 
the way that much industrial experimentation is conducted. They must present the results 
of this project, both orally and in written form. 


THE SUPPLEMENTAL TEXT MATERIAL 

For the fifth edition I have prepared supplemental text material for each chapter of the 
book. Often, this supplemental material elaborates on topics that could not be discussed 
in greater detail in the book. I have also presented some subjects that do not appear 
directly in the book, but an introduction to them could prove useful to some students and 
professional practitioners. Some of this material is at a higher mathematical level than 
the text. I realize that instructors use this book with a wide array of audiences, and some 
more advanced design courses could possibly benefit from including several of the sup¬ 
plemental text material topics. This material is in electronic form on the Instructor’s 
CD/ROM and is on the World Wide Web site for this book. 


WEB SITE 

Current supporting material for instructors and students is available at the Web site 
www.wiley.com/college/montgomery. This site will be used to communicate information 
about innovations and recommendations for effectively using this text. The supplemental 
text material described above is available at the site, along with electronic versions of 
data sets used for examples and homework problems, a course syllabus, a complete set 
of Power Point lecture slides created by the author, and student term projects from the 
course at Arizona State University. 
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Introduction 



1 -1 STRATEGY OF EXPERIMENTATION 

Experiments are performed by investigators in virtually all fields of inquiry, usually to 
discover something about a particular process or system. Literally, an experiment is a 
test. More formally, we can define an experiment as a test or series of tests in which 
purposeful changes are made to the input variables of a process or system so that we may 
observe and identify the reasons for changes that may be observed in the output response. 

This book is about planning and conducting experiments and about analyzing the 
resulting data so that valid and objective conclusions are obtained. Our focus is on ex¬ 
periments in the engineering, physical, and chemical sciences. In engineering, experi¬ 
mentation plays an important role in new product design, manufacturing process 
development, and process improvement. The objective in many cases may be to develop 
a robust process, that is, a process affected minimally by external sources of variability. 

As an example of an experiment, suppose that a metallurgical engineer is interested 
in studying the effect of two different hardening processes, oil quenching and saltwater 
quenching, on an aluminum alloy. Here the objective of the experimenter is to determine 
which quenching solution produces the maximum hardness for this particular alloy. The 
engineer decides to subject a number of alloy specimens or test coupons to each quench¬ 
ing medium and measure the hardness of the specimens after quenching. The average 
hardness of the specimens treated in each quenching solution will be used to determine 
which solution is best. 

As we consider this simple experiment, a number of important questions come to 
mind: 

1. Are these two solutions the only quenching media of potential interest? 

2. Are there any other factors that might affect hardness that should be investigated 
or controlled in this experiment? 

3. How many coupons of alloy should be tested in each quenching solution? 

4. How should the test coupons be assigned to the quenching solutions, and in what 
order should the data be collected? 

1 
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5. What method of data analysis should be used? 

6. What difference in average observed hardness between the two quenching media 
will be considered important? 

All of these questions, and perhaps many others, will have to be answered satisfactorily 
before the experiment is performed. 

In any experiment, the results and conclusions that can be drawn depend to a large 
extent on the manner in which the data were collected. To illustrate this point, suppose 
that the metallurgical engineer in the above experiment used specimens from one heat 
in the oil quench and specimens from a second heat in the saltwater quench. Now, when 
the mean hardness is compared, the engineer is unable to say how much of the observed 
difference is the result of the quenching media and how much is the result of inherent 
differences between the heats. 1 Thus, the method of data collection has adversely affected 
the conclusions that can be drawn from the experiment. 

In general, experiments are used to study the performance of processes and systems. 
The process or system can be represented by the model shown in Figure 1-1. We can 
usually visualize the process as a combination of machines, methods, people, and other 
resources that transforms some input (often a material) into an output that has one or 
more observable responses. Some of the process variables x u x 2 ,... ,x p are controllable, 
whereas other variables z t , z 2 ,..., z q are uncontrollable (although they may be control¬ 
lable for purposes of a test). The objectives of the experiment may include the following: 

1. Determining which variables are most influential on the response y 

2. Determining where to set the influential x’s so that y is almost always near the 
desired nominal value 

3. Determining where to set the influential x’s so that variability in y is small. 

4. Determining where to set the influential x’s so that the effects of the 
uncontrollable variables z u z 2 ,..., z q are minimized. 

As you can see from the foregoing discussion, experiments often involve several 
factors. Usually, an objective of the person conducting the experiment, called the ex¬ 
perimenter, is to determine the influence that these factors have on the output response 


Controllable factors 





Uncontrollable factors 


Figure 1-1 General model of a process or 
system. 


1 A specialist in experimental design would say that the effects of quenching media and heat were confounded ; that 
is, the effects of these two factors cannot he separated. 
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of the system. The general approach to planning and conducting the experiment is called 
the strategy of experimentation. There are several strategies that an experimenter could 
use. We will illustrate some of these with a very simple example. 

The author really likes to play golf. Unfortunately, he does not enjoy practicing, so 
he is always looking for a simpler solution to lowering his score. Some of the factors 
that he thinks may be important, or that may in fluence his golf score, are as follows: 

1. The type of driver used (oversized or regular-sized) 

2. The type of ball used (balata or three-piece) 

3. Walking and carrying the golf clubs or riding in a golf cart 

4. Drinking water or drinking beer while playing 

5. Playing in the morning or playing in the afternoon 

6. Playing when it is cool or playing when it is hot 

7. The type of golf shoe spike worn (metal or soft) 

8. Playing on a windy day or playing on a calm day 

There are obviously many other factors that could be considered, but let’s assume that 
these are the ones of primary interest. Furthermore, based on long experience with the 
game, he decides that factors 5 through 8 can be ignored; that is, these factors are not 
important because their effects are so small that they have no practical value. Engineers 
and scientists often must make these types of decisions about some of the factors they 
are considering in real experiments. 

Now, let’s consider how factors 1 through 4 could be experimentally tested to de¬ 
termine their effect on the author’s golf score. Suppose that a maximum of eight rounds 
of golf can be played over the course of the experiment. One approach would be to select 
an arbitrary combination of these factors, test them, and see what happens. For example, 
suppose the oversized driver, balata ball, golf cart, and water combination is selected, 
and the resulting score is 87. During the round, however, the author noticed several 
wayward shots with the big driver (long is not always good in golf), and, as a result, he 
decides to play another round with the regular-sized driver, holding the other factors at 
the same levels used previously. This approach could be continued almost indefinitely, 
switching the levels of one (or perhaps two) factors for the next test, based on the outcome 
of the current test. This strategy of experimentation, which we call the best-guess ap¬ 
proach, is used frequently in practice by engineers and scientists. It often works reason¬ 
ably well, too, because the experimenters often have a great deal of technical or 
theoretical knowledge of the system they are studying, as well as considerable practical 
experience. There are, however, at least two disadvantages of the best-guess approach. 
First, suppose the initial best-guess does not produce the desired results. Now the ex¬ 
perimenter has to take another guess at the correct combination of factor levels. This 
could continue for a long time, without any guarantee of success. Second, suppose the 
initial best-guess produces an acceptable result. Now the experimenter is tempted to stop 
testing, although there is no guarantee that the best solution has been found. 

Another strategy of experimentation that is used extensively in practice is the one- 
factor-at-a-time approach. This method consists of selecting a starting point, or baseline 
set of levels, for each factor, then successively varying each factor over its range with 
the other factors held constant at the baseline level. After all tests are performed, a series 
of graphs are usually constructed showing how the response variable is affected by vary- 
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Figure 1-2 Results of the one-factor-at-a-time strategy for the golf experiment. 
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ing each factor with all other factors held constant. Figure 1-2 shows a set of these graphs 
for the golf experiment, using the oversized driver, balata ball, walking, and drinking 
water levels of the four factors as the baseline. The interpretation of this graph is straight¬ 
forward; for example, because the slope of the mode of travel curve is negative, we 
would conclude that riding improves the score. Using these one-factor-at-a-time graphs, 
we would select the optimal combination to be the regular-sized driver, riding, and drink¬ 
ing water. The type of golf ball seems unimportant. 

The major disadvantage of the one-factor-at-a-time strategy is that it fails to consider 
any possible interaction between the factors. An interaction is the failure of the one 
factor to produce the same effect on the response at different levels of another factor. 
Figure 1-3 shows an interaction between the type of driver and the beverage factors for 
the golf experiment. Notice that if the author uses the regular-sized driver, the type of 
beverage consumed has virtually no effect on his score, but if he uses the oversized 
driver, much better results are obtained by drinking water instead of beer. Interactions 
between factors are very common, and if they occur, the one-factor-at-a-time strategy 
will usually produce poor results. Many people do not recognize this, and, consequently, 
one-factor-at-a-time experiments are run frequently in practice. (Some individuals actu¬ 
ally think that this strategy is related to the scientific method or that it is a “sound” 
engineering principle.) One-factor-at-a-time experiments are always less efficient than 
other methods based on a statistical approach to design. We will discuss this in more 
detail in Chapter 5. 

The correct approach to dealing with several factors is to conduct a factorial ex¬ 
periment. This is an experimental strategy in which factors are varied together, instead 
of one at a time. The factorial experimental design concept is extremely important, and 



Figure 1-3 Interaction between type 
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golf experiment. 
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several chapters in this book are devoted to presenting basic factorial experiments and a 
number of useful variations and special cases. 

To illustrate how a factorial experiment is conducted, consider the golf experiment 
and suppose that only two factors, type of driver and type of ball, are of interest. Figure 
1-4 shows a two-factor factorial experiment for studying the joint effects of these two 
factors on the author’s golf score. Notice that this factorial experiment has both factors 
at two levels and that all possible combinations of the two factors across their levels are 
used in the design. Geometrically, the four runs form the comers of a square. This 
particular type of factorial experiment is called a 2 2 factorial design (two factors, each 
at two levels). Because the author can reasonably expect to play eight rounds of golf to 
investigate these factors, a reasonable plan would be to play two rounds of golf at each 
combination of factor levels shown in Figure 1-4. An experimental designer would say 
that we have replicated the design twice. This experimental design would enable the 
experimenter to investigate the individual effects of each factor (or the main effects) and 
to determine whether the factors interact. 

Figure l-5a shows the results of performing the factorial experiment in Figure 1-4. 
The scores from each round of golf played at the four test combinations are shown at 
the comers of the square. Notice that there are four rounds of golf that provide infor¬ 
mation about using the regular-sized driver and four rounds that provide information 
about using the oversized driver. By finding the average difference in the scores on the 
right- and left-hand sides of the square (as in Figure 1-56), we have a measure of the 
effect of switching from the oversized driver to the regular-sized driver, or 

^ 92 + 94 + 93 + 91 88 + 91 + 88 + 90 

Driver Effect =----- 
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= 3.25 
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Figure 1-5 Scores from the golf experiment in Figure 1-4 and calculation of the factor effects. 
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That is, on average, switching from the oversized to the regular-sized driver increases 
the score by 3.25 strokes per round. Similarly, the average difference in the four scores 
at the top of the square and the four scores at the bottom measure the effect of the type 
of ball used (see Figure l-5c): 


1 + 91+92 + 94 


1 + 90 + 93 + 91 


Finally, a measure of the interaction effect between the type of ball and the type of driver 
can be obtained by subtracting the average scores on the left-to-right diagonal in the 
square from the average scores on the right-to-left diagonal (see Figure \-5d), resulting 
in 


Ball-Driver Interaction Effect = 


92 + 94 + 88 + 90 
4 


= 0.25 


88 + 91 + 93 + 91 
4 


The results of this factorial experiment indicate that driver effect is larger than either 
the ball effect or the interaction. Statistical testing could be used to determine whether 
any of these effects differ from zero. In fact, it turns out that there is reasonably strong 
statistical evidence that the driver effect differs from zero and the other two effects do 
not. Therefore, perhaps the author should always play with the oversized driver. 

One very important feature of the factorial experiment is evident from this simple 
example; namely, factorials make the most efficient use of the experimental data. Notice 
that this experiment included eight observations, and all eight observations are used to 
calculate the driver, the ball, and interaction effects. No other strategy of experimentation 
makes so efficient use of the data. This is an important and useful feature of factorials. 

We can extend the factorial experiment concept to three factors. Suppose that the 
author wishes to study the effects of type of driver, type of ball, and the type of beverage 
consumed on his golf score. Assuming that all three factors have two levels, a factorial 
design can be set up as shown in Figure 1-6. Notice that there are eight test combinations 
of these three factors across the two levels of each and that these eight trials can be 
represented geometrically as the comers of a cube. This is an example of a 2 3 factorial 
design. Because the author only wants to play eight rounds of golf, this experiment would 
require that one round be played at each combination of factors represented by the eight 
comers of the cube in Figure 1-6. However, if we compare this to the two-factor factorial 
in Figure 1-4, the 2 3 factorial design would provide the same information about the factor 



Figure 1-6 A three-factor factorial experiment involving 
type of driver, type of ball, and type of beverage. 






STRATEGY OF EXPERIMENTATION 


7 


Mode of travel 




Figure 1-7 A four-factor factorial experiment involving type of driver, type of 
ball, type of beverage, and mode of travel. 


effects. For example, in both designs there are four tests that provide information about 
the regular-sized driver and four tests that provide information about the oversized driver, 
assuming that each run in the two-factor design in Figure 1-4 is replicated twice. 

Figure 1-7 illustrates how all four factors—driver, ball, beverage, and mode of travel 
(walking or riding)—could be investigated in a 2 4 factorial design. As in any factorial 
design, all possible combinations of the levels of the factors are used. Because all four 
factors are at two levels, this experimental design can still be represented geometrically 
as a cube (actually a hypercube). 

Generally, if there are k factors, each at two levels, the factorial design would require 
2 k runs. For example, the experiment in Figure 1-7 requires 16 runs. Clearly, as the 
number of factors of interest increases, the number of runs required increases rapidly; 
for instance, a 10-factor experiment with all factors at two levels would require 1024 
runs. This quickly becomes infeasible from a time and resource viewpoint. In the golf 
experiment, the author can only play eight rounds of golf, so even the experiment in 
Figure 1-7 is too large. 

Fortunately, if there are four to five or more factors, it is usually unnecessary to run 
all possible combinations of factor levels. A fractional factorial experiment is a vari¬ 
ation of the basic factorial design in which only a subset of the runs are made. Figure 
1-8 shows a fractional factorial design for the four-factor version of the golf experiment. 
This design requires only 8 runs instead of the original 16 and would be called a one- 
half fraction. If the author can only play eight rounds of golf, this is an excellent design 
in which to study all four factors. It will provide good information about the main effects 
of the four factors as well as some information about how these factors interact. 

Fractional factorial designs are used extensively in industrial research and devel¬ 
opment, and for process improvement. These designs will be discussed in Chapter 8. 
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Figure 1-8 A four-factor fractional factorial experiment involving type of driver, 
type of ball, type of beverage, and mode of travel. 
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1 -2 SOME TYPICAL APPLICATIONS OF EXPERIMENTAL DESIGN 

Experimental design methods have found broad application in many disciplines. In fact, 
we may view experimentation as part of the scientific process and as one of the ways 
we learn about how systems or processes work. Generally, we learn through a series of 
activities in which we make conjectures about a process, perform experiments to generate 
data from the process, and then use the information from the experiment to establish new 
conjectures, which lead to new experiments, and so on. 

Experimental design is a critically important tool in the engineering world for im¬ 
proving the performance of a manufacturing process. It also has extensive application in 
the development of new processes. The application of experimental design techniques 
early in process development can result in 

1. Improved process yields 

2. Reduced variability and closer conformance to nominal or target requirements 

3. Reduced development time 

4. Reduced overall costs 

Experimental design methods also play a major role in engineering design activities, 
where new products are developed and existing ones improved. Some applications of 
experimental design in engineering design include 

1. Evaluation and comparison of basic design configurations 

2. Evaluation of material alternatives 

3. Selection of design parameters so that the product will work well under a wide 
variety of field conditions, that is, so that the product is robust 

4. Determination of key product design parameters that impact product 
performance 

The use of experimental design in these areas can result in products that are easier to 
manufacture, products that have enhanced field performance and reliability, lower prod¬ 
uct cost, and shorter product design and development time. We now present several 
examples that illustrate some of these ideas. 

EXAMPLE 1-1 . 

Characterizing a Process 

A flow solder machine is used in the manufacturing process for printed circuit boards. 
The machine cleans the boards in a flux, preheats the boards, and then moves them along 
a conveyor through a wave of molten solder. This solder process makes the electrical 
and mechanical connections for the leaded components on the board. 

The process currently operates around the 1 percent defective level. That is, about 
1 percent of the solder joints on a board are defective and require manual retouching. 
However, because the average printed circuit board contains over 2000 solder joints, 
even a 1 percent defective level results in far too many solder joints requiring rework. 
The process engineer responsible for this area would like to use a designed experiment 
to determine which machine parameters are influential in the occurrence of solder defects 
and which adjustments should be made to those variables to reduce solder defects. 
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The flow solder machine has severed variables that can be controlled. They include 

1. Solder temperature 

2. Preheat temperature 

3. Conveyor speed 

4. Flux type 

5. Flux specific gravity 

6. Solder wave depth 

7. Conveyor angle 

In addition to these controllable factors, there are several other factors that cannot be 
easily controlled during routine manufacturing, although they could be controlled for the 
purposes of a test. They are 

1. Thickness of the printed circuit board 

2. Types of components used on the board 

3. Layout of the components on the board 

4. Operator 

5. Production rate 

In this situation, the engineer is interested in characterizing the flow solder machine; 
that is, he wants to determine which factors (both controllable and uncontrollable) affect 
the occurrence of defects on the printed circuit boards. To accomplish this, he can design 
an experiment that will enable him to estimate the magnitude and direction of the factor 
effects; that is, how much does the response variable (defects per unit) change when each 
factor is changed, and does changing the factors together produce different results than 
are obtained from individual factor adjustments—that is, do the factors interact? Some¬ 
times we call an experiment such as this a screening experiment. Typically, screening 
or characterization experiments involve using fractional factorial designs, such as in the 
golf example in Figure 1-8. 

The information from this screening or characterization experiment will be used to 
identify the critical process factors and to determine the direction of adjustment for these 
factors to reduce further the number of defects per unit. The experiment may also provide 
information about which factors should be more carefully controlled during routine man¬ 
ufacturing to prevent high defect levels and erratic process performance. Thus, one result 
of the experiment could be the application of techniques such as control charts to one or 
more process variables (such as solder temperature), in addition to control charts on 
process output. Over time, if the process is improved enough, it may be possible to base 
most of the process control plan on controlling process input variables instead of control 
charting the output. 


EXAMPLE 1-2 . 

Optimizing a Process 

In a characterization experiment, we are usually interested in determining which process 
variables affect the response. A logical next step is to optimize, that is, to determine the 
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region in the important factors that leads to the best possible response. For example, if 
the response is yield, we would look for a region of maximum yield, whereas if the 
response is variability in a critical product dimension, we would seek a region of mini¬ 
mum variability. 

Suppose that we are interested in improving the yield of a chemical process. We 
know from the results of a characterization experiment that the two most important 
process variables that influence the yield are operating temperature and reaction time. 
The process currently runs at 145°F and 2.1 hours of reaction time, producing yields of 
around 80 percent. Figure 1-9 shows a view of the time-temperature region from above. 
In this graph, the lines of constant yield are connected to form response contours, and 
we have shown the contour lines for yields of 60, 70, 80, 90, and 95 percent. These 
contours are projections on the time—temperature region of cross sections of the yield 
surface corresponding to the aforementioned percent yields. This surface is sometimes 
called a response surface. The true response surface in Figure 1-9 is unknown to the 
process personnel, so experimental methods will be required to optimize the yield with 
respect to time and temperature. 

To locate the optimum, it is necessary to perform an experiment that varies time and 
temperature together, that is, a factorial experiment. The results of an initial factorial 
experiment with both time and temperature run at two levels is shown in Figure 1-9. The 
responses observed at the four comers of the square indicate that we should move in the 
general direction of increased temperature and decreased reaction time to increase yield. 



Time (hours) 

Figure 1-9 Contour plot of yield as a function of reaction time 
and reaction temperature, illustrating experimentation to optimize a 
process. 
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A few additional runs would be performed in this direction, and this additional experi¬ 
mentation would lead us to the region of maximum yield. 

Once we have found the region of the optimum, a second experiment would typically 
be performed. The objective of this second experiment is to develop an empirical model 
of the process and to obtain a more precise estimate of the optimum operating conditions 
for time and temperature. This approach to process optimization is called response sur¬ 
face methodology, and it is explored in detail in Chapter 11. The second design illus¬ 
trated in Figure 1-9 is a central composite design, one of the most important 
experimental designs used in process optimization studies. 


EXAMPLE 1-3 . 

A Product Design Example 

Experimental design methods can often be applied to the product design process. To 
illustrate, suppose that a group of engineers is designing a door hinge for an automobile. 
The quality characteristic of interest is the check effort, or the holding ability of the door 
latch that prevents the door from swinging closed when the vehicle is parked on a hill. 
The check mechanism consists of a leaf spring and a roller. When the door is opened, 
the roller travels through an arc, causing the leaf spring to be compressed. To close the 
door, the spring must be forced aside, and this creates the check effort. The engineering 
team thinks that check effort is a function of the following factors: 

1. Roller travel distance 

2. Spring height from pivot to base 

3. Horizontal distance from pivot to spring 

4. Free height of the reinforcement spring 

5. Free height of the main spring 

The engineers can build a prototype hinge mechanism in which all of these factors 
can be varied over certain ranges. Once appropriate levels for these five factors have 
been identified, an experiment can be designed consisting of various combinations of the 
factor levels, and the prototype hinge can be tested at these combinations. This will 
produce information concerning which factors are most influential on the latch check 
effort, and through analysis of this information, the latch design can be improved. 


1-3 BASIC PRINCIPLES 

If an experiment such as the ones described in Examples 1-1 through 1-3 is to be per¬ 
formed most efficiently, a scientific approach to planning the experiment must be em¬ 
ployed. Statistical design of experiments refers to the process of planning the 
experiment so that appropriate data that can be analyzed by statistical methods will be 
collected, resulting in valid and objective conclusions. The statistical approach to exper¬ 
imental design is necessary if we wish to draw meaningful conclusions from the data. 
When the problem involves data that are subject to experimental errors, statistical meth¬ 
odology is the only objective approach to analysis. Thus, there are two aspects to any 
experimental problem: the design of the experiment and the statistical analysis of the 
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data. These two subjects are closely related because the method of analysis depends 
directly on the design employed. Both topics will be addressed in this book. 

The three basic principles of experimental design are replication, randomization, 
and blocking. By replication we mean a repetition of the basic experiment. In the met¬ 
allurgical experiment discussed in Section 1-1, replication would consist of treating a 
specimen by oil quenching and treating a specimen by saltwater quenching. Thus, if five 
specimens are treated in each quenching medium, we say that five replicates have been 
obtained. Replication has two important properties. First, it allows the experimenter to 
obtain an estimate of the experimental error. This estimate of error becomes a basic unit 
of measurement for determining whether observed differences in the data are really sta¬ 
tistically different. Second, if the sample mean (e.g., y) is used to estimate the effect of 
a factor in the experiment, replication permits the experimenter to obtain a more precise 
estimate of this effect. For example; if cr 2 is the variance of an individual observation 
and there are n replicates, the variance of the sample mean is 

, cr 2 

<T± = - 

n 

The practical implication of this is that if we had n = 1 replicates and observed 
y x = 145 (oil quench) and y 2 = 147 (saltwater quench), we would probably be unable 
to make satisfactory inferences about the effect of the quenching medium—that is, the 
observed difference could be the result of experimental error. On the other hand, if n 
was reasonably large, and the experimental error was sufficiently small and if we ob¬ 
served yi <y 2 , we would be reasonably safe in concluding that saltwater quenching 
produces a higher hardness in this particular aluminum alloy than does oil quenching. 

There is an important distinction between replication and repeated measurements. 
For example, suppose that a silicon wafer is etched in a single-wafer plasma etching 
process, and a critical dimension on this wafer is measured three times. These measure¬ 
ments are not replicates; they are a form of repeated measurements, and in this case, the 
observed variability in the three repeated measurements is a direct reflection of the in¬ 
herent variability in the measurement system or gauge. As another illustration, suppose 
that as part of an experiment in semiconductor manufacturing, four wafers are processed 
simultaneously in an oxidation furnace at a particular gas flow rate and time and then a 
measurement is taken on the oxide thickness of each wafer. Once again, the measurement 
on the four wafers are not replicates but repeated measurements. In this case they reflect 
differences among the wafers and other sources of variability within that particular fur¬ 
nace run. Replication reflects sources of variability both between runs and (potentially) 
within runs. 

Randomization is the cornerstone underlying the use of statistical methods in ex¬ 
perimental design. By randomization we mean that both the allocation of the experi¬ 
mental material and the order in which the individual runs or trials of the experiment are 
to be performed are randomly determined. Statistical methods require that the observa¬ 
tions (or errors) be independently distributed random variables. Randomization usually 
makes this assumption valid. By properly randomizing the experiment, we also assist in 
“averaging out” the effects of extraneous factors that may be present. For example, 
suppose that the specimens in the above experiment are of slightly different thicknesses 
and that the effectiveness of the quenching medium may be affected by specimen thick¬ 
ness. If all the specimens subjected to the oil quench are thicker than those subjected to 
the saltwater quench, we may be introducing systematic bias into the experimental results. 
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This bias handicaps one of the quenching media and consequently invalidates our results. 
Randomly assigning the specimens to the quenching media alleviates this problem. 

Computer software programs are widely used to assist experimenters in selecting 
and constructing experimental designs. These programs often present the runs in the 
experimental design in random order. This random order is usually created by using a 
random number generator. Even with such a computer program, it is still often necessary 
to assign experimental material (such as the wafers in the semiconductor examples men¬ 
tioned above), operators, gauges or measurement devices, and so forth, for use in the 
experiment. Tables of random numbers can be used to ensure that random assignments 
are made. 

Sometimes experimenters encounter situations where randomization of some aspect 
of the experiment is difficult. For example, in a chemical process, temperature may be a 
very hard-to-change variable, making complete randomization of this factor almost 
impossible. There are statistical design methods for dealing with restrictions on random¬ 
ization. Some of these approaches will be discussed in subsequent chapters (see in par¬ 
ticular, Chapter 13). 

Blocking is a design technique used to improve the precision with which compari¬ 
sons among the factors of interest are made. Often blocking is used to reduce or eliminate 
the variability transmitted from nuisance factors; that is, factors that may influence the 
experimental response but in which we are not directly interested. For example, an ex¬ 
periment in a chemical process may require two batches of raw material to make all the 
required runs. However, there could be differences between the batches due to supplier- 
to-supplier variability, and if we are not specifically interested in this effect, we would 
think of the batches of raw material as a nuisance factor. Generally, a block is a set of 
relatively homogeneous experimental conditions. In the chemical process example, each 
batch of raw material would form a block, because the variability within a batch would 
be expected to be smaller than the variability between batches. Typically, as in this 
example, each level of the nuisance factor becomes a block. Then the experimenter 
divides the observations from the statistical design into groups that are run in each block. 
We study blocking in detail in several places in the text, including Chapters 4, 5, 7, 8, 
9, 11, and 13. A simple example illustrating the blocking principal is given in Chapter 
2, Section 2-5.1. 

The three basic principles of experimental design, randomization, replication, and 
blocking are part of every experiment. We will illustrate and emphasize them repeatedly 
throughout this book. 


1-4 GUIDELINES FOR DESIGNING EXPERIMENTS 

To use the statistical approach in designing and analyzing an experiment, it is necessary 
for everyone involved in the experiment to have a clear idea in advance of exactly what 
is to be studied, how the data are to be collected, and at least a qualitative understanding 
of how these data are to be analyzed. An outline of the recommended procedure is shown 
in Table 1 -1 on the next page. We now give a brief discussion of this outline and elaborate 
on some of the key points. For more details, see Coleman and Montgomery (1993), and 
the references therein. The supplemental text material for this chapter is also useful. 


1. Recognition of and statement of the problem. This may seem to be a rather obvious 
point, but in practice it is often not simple to realize that a problem requiring experi- 
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Table 1-1 Guidelines for Designing an Experiment 

1. Recognition of and statement of the problem. 1 

2. Choice of factors, levels, and ranges." Pre-expenmental 

3. Selection of the response variable." J P lannm 8 

4. Choice of experimental design. 

5. Performing the experiment. 

6. Statistical analysis of the data. 

7. Conclusions and recommendations. 

“ In practice, steps 2 and 3 are often done simultaneously or in reverse order. 

mentation exists, nor is it simple to develop a clear and generally accepted statement of 
this problem. It is necessary to develop all ideas about the objectives of the experiment. 
Usually, it is important to solicit input from all concerned parties: engineering, quality 
assurance, manufacturing, marketing, management, the customer, and operating person¬ 
nel (who usually have much insight and who are too often ignored). For this reason a 
team approach to designing experiments is recommended. 

It is usually helpful to prepare a list of specific problems or questions that are to be 
addressed by the experiment. A clear statement of the problem often contributes sub¬ 
stantially to better understanding of the phenomenon being studied and the final solution 
of the problem. It is also important to keep the overall objective in mind; for example, 
is this a new process or system—in which case the initial objective is likely to be char¬ 
acterization or factor screening—or is it a mature or reasonably well-understood system 
that has been previously characterized—in which case the objective may be optimiza¬ 
tion? There are many possible objectives of an experiment, including confirmation (is 
the system performing the same way now that it did in the past?), discovery (what 
happens if we explore new materials, variables, operating conditions, etc.?), and stability 
(under what conditions do the response variables of interest seriously degrade?). Obvi¬ 
ously, the specific questions to be addressed in the experiment relate directly to the overall 
objectives. Often at this stage of problem formulation many engineers and scientists 
realize that one large comprehensive experiment is unlikely to answer the key questions 
and that a sequential approach using a series of smaller experiments is a better strategy. 

2. Choice of factors, levels, and range. (As noted in Table 1-1, steps 2 and 3 are often 
done simultaneously, or in the reverse order.) When considering the factors that may 
influence the performance of a process or system, the experimenter usually discovers that 
these factors can be classified as either potential design factors or nuisance factors. The 
potential design factors are those factors that the experimenter may wish to vary in the 
experiment. Often we find that there are a lot of potential design factors, and some further 
classification of them is helpful. Some useful classifications are design factors, held- 
constant factors, and allowed-to-vary factors. The design factors are the factors actually 
selected for study in the experiment. Held-constant factors are variables that may exert 
some effect on the response, but for purposes of the present experiment these factors are 
not of interest, so they will be held at a specific level. For example, in an etching ex¬ 
periment in the semiconductor industry there may be an effect that is unique to the 
specific plasma etch tool used in the experiment. However, this factor would be very 
difficult to vary in an experiment, so the experimenter may decide to perform all exper¬ 
imental runs on one particular (ideally “typical”) etcher. Thus this factor has been held 
constant. As an example of allowed-to-vary factors, the experimental units or the “ma¬ 
terials” to which the design factors are applied are usually nonhomogeneous, yet we 
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often ignore this unit-to-unit variability and rely on randomization to balance out any 
material or experimental unit effect. We often assume that the effects of held-constant 
factors and allowed-to-vary factors are relatively small. 

Nuisance factors, on the other hand, may have large effects that must be accounted 
for, yet we may not be interested in them in the context of the present experiment. 
Nuisance factors are often classified as controllable, uncontrollable, or noise factors. 
A controllable nuisance factor is one whose levels may be set by the experimenter. For 
example, the experimenter can select different batches of raw material or different days 
of the week when conducting the experiment. The blocking principal, discussed in the 
previous section, is often useful in dealing with controllable nuisance factors. If a nui¬ 
sance factor is uncontrollable in the experiment, but it can be measured, an analysis 
procedure called the analysis of covariance can often be used to compensate for its 
effect. For example, the relative humidity in the process environment may affect process 
performance, and if the humidity cannot be controlled, it probably can be measured and 
treated as a covariate. When a factor that varies naturally and uncontrollably in the 
process can be controlled for purposes of an experiment, we often call it a noise factor. 
In such situations, our objective is usually to find the settings of the controllable design 
factors that minimize the variability transmitted from the noise factors. This is sometimes 
called a process robustness study or a robust design problem. Blocking, analysis of 
covariance, and process robustness studies are discussed later in the text. 

Once the experimenter has selected the design factors, he or she must choose the 
ranges over which these factors will be varied, and the specific levels at which runs will 
be made. Thought must also be given to how these factors are to be controlled at the 
desired values and how they are to be measured. For instance, in the flow solder exper¬ 
iment, the engineer has defined 12 variables that may affect the occurrence of solder 
defects. The engineer will also have to decide on a region of interest for each variable 
(that is, the range over which each factor will be varied) and on how many levels of each 
variable to use. Process knowledge is required to do this. This process knowledge is 
usually a combination of practical experience and theoretical understanding. It is impor¬ 
tant to investigate all factors that may be of importance and to not be overly influenced 
by past experience, particularly when we are in the early stages of experimentation or 
when the process is not very mature. 

When the objective of the experiment is factor screening or process characterization, 
it is usually best to keep the number of factor levels low. Generally, two levels work 
very well in factor screening studies. Choosing the region of interest is also important. 
In factor screening, the region of interest should be relatively large—that is, the range 
over which the factors are varied should be broad. As we learn more about which vari¬ 
ables are important and which levels produce the best results, the region of interest will 
usually become narrower. 

3. Selection of the response variable. In selecting the response variable, the experi¬ 
menter should be certain that this variable really provides useful information about the 
process under study. Most often, the average or standard deviation (or both) of the mea¬ 
sured characteristic will be the response variable. Multiple responses are not unusual. 
Gauge capability (or measurement error) is also an important factor. If gauge capability 
is inadequate, only relatively large factor effects will be detected by the experiment or 
perhaps additional replication will be required. In some situations where gauge capability 
is poor, the experimenter may decide to measure each experimental unit several times 
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and use the average of the repeated measurements as the observed response. It is usually 
critically important to identify issues related to defining the responses of interest and 
how they are to be measured before conducting the experiment. Sometimes designed 
experiments are employed to study and improve the performance of measurement sys¬ 
tems. For an example, see Chapter 12. 

We reiterate how crucial it is to bring out all points of view and process information 
in steps 1 through 3 above. We refer to this as pre-experimental planning. Coleman 
and Montgomery (1993) provide worksheets that can be useful in pre-experimental plan¬ 
ning. Also see the supplemental text information for more details and an example of 
using these worksheets. It is unlikely that one person has all the knowledge required to 
do this adequately in many situations. Therefore, we argue strongly for a team effort 
in planning the experiment. Most of your success will hinge on how well the pre- 
experimental planning is done. 

4. Choice of experimental design. If the pre-experimental planning activities above 
are done correctly, this step is relatively easy. Choice of design involves the consideration 
of sample size (number of replicates), the selection of a suitable run order for the ex¬ 
perimental trials, and the determination of whether or not blocking or other randomization 
restrictions are involved. This book discusses some of the more important types of ex¬ 
perimental designs, and it can ultimately be used as a catalog for selecting an appropriate 
experimental design for a wide variety of problems. 

There are also several interactive statistical software packages that support this phase 
of experimental design. The experimenter can enter information about the number of 
factors, levels, and ranges, and these programs will either present a selection of designs 
for consideration or recommend a particular design. (We prefer to see several alternatives 
instead of relying on a computer recommendation in most cases.) These programs will 
usually also provide a worksheet (with the order of the runs randomized) for use in 
conducting the experiment. 

In selecting the design, it is important to keep the experimental objectives in mind. 
In many engineering experiments, we already know at the outset that some of the factor 
levels will result in different values for the response. Consequently, we are interested in 
identifying which factors cause this difference and in estimating the magnitude of the 
response change. In other situations, we may be more interested in verifying uniformity. 
For example, two production conditions A and B may be compared, A being the standard 
and B being a more cost-effective alternative. The experimenter will then be interested 
in demonstrating that, say, there is no difference in yield between the two conditions. 

5. Performing the experiment. When running the experiment, it is vital to monitor the 
process carefully to ensure that everything is being done according to plan. Errors in 
experimental procedure at this stage will usually destroy experimental validity. Up-front 
planning is crucial to success. It is easy to underestimate the logistical and planning 
aspects of running a designed experiment in a complex manufacturing or research and 
development environment. 

Coleman and Montgomery (1993) suggest that prior to conducting the experiment a 
few trial runs or pilot runs are often helpful. These runs provide information about con¬ 
sistency of experimental material, a check on the measurement system, a rough idea of 
experimental error, and a chance to practice the overall experimental technique. This 
also provides an opportunity to revisit the decisions made in steps 1-4, if necessary. 
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6. Statistical analysis of the data. Statistical methods should be used to analyze the 
data so that results and conclusions are objective rather than judgmental in nature. If the 
experiment has been designed correctly and if it has been performed according to 
the design, the statistical methods required are not elaborate. There are many excellent 
software packages designed to assist in data analysis, and many of the programs used in 
step 4 to select the design provide a seamless, direct interface to the statistical analysis. 
Often we find that simple graphical methods play an important role in data analysis and 
interpretation. Because many of the questions that the experimenter wants to answer can 
be cast into an hypothesis-testing framework, hypothesis testing and confidence interval 
estimation procedures are very useful in analyzing data from a designed experiment. It 
is also usually very helpful to present the results of many experiments in terms of an 
empirical model, that is, an equation derived from the data that expresses the relationship 
between the response and the important design factors. Residual analysis and model 
adequacy checking are also important analysis techniques. We will discuss these issues 
in detail later. 

Remember that statistical methods cannot prove that a factor (or factors) has a partic¬ 
ular effect. They only provide guidelines as to the reliability and validity of results. 
Properly applied, statistical methods do not allow anything to be proved experimentally, 
but they do allow us to measure the likely error in a conclusion or to attach a level of 
confidence to a statement. The primary advantage of statistical methods is that they add 
objectivity to the decision-making process. Statistical techniques coupled with good en¬ 
gineering or process knowledge and common sense will usually lead to sound 
conclusions. 

7. Conclusions and recommendations. Once the data have been analyzed, the exper¬ 
imenter must draw practical conclusions about the results and recommend a course of 
action. Graphical methods are often useful in this stage, particularly in presenting the 
results to others. Follow-up runs and confirmation testing should also be performed to 
validate the conclusions from the experiment. 

Throughout this entire process, it is important to keep in mind that experimentation 
is an important part of the learning process, where we tentatively formulate hypotheses 
about a system, perform experiments to investigate these hypotheses, and on the basis 
of the results formulate new hypotheses, and so on. This suggests that experimentation 
is iterative. It is usually a major mistake to design a single, large, comprehensive ex¬ 
periment at the start of a study. A successful experiment requires knowledge of the 
important factors, the ranges over which these factors should be varied, the appropriate 
number of levels to use, and the proper units of measurement for these variables. Gen¬ 
erally, we do not perfectly know the answers to these questions, but we learn about them 
as we go along. As an experimental program progresses, we often drop some input 
variables, add others, change the region of exploration for some factors, or add new 
response variables. Consequently, we usually experiment sequentially, and as a general 
rule, no more than about 25 percent of the available resources should be invested in the 
first experiment. This will ensure that sufficient resources are available to perform con¬ 
firmation runs and ultimately accomplish the final objective of the experiment. 

1-5 A BRIEF HISTORY OF STATISTICAL DESIGN 

There have been four eras in the modem development of statistical experimental design. 
The agricultural era was led by the pioneering work of Sir Ronald A. Fisher in the 1920s 



18 


CHAPTER 1 INTRODUCTION 


and early 1930s. During this time, Fisher was responsible for statistics and data analysis 
at the Rothamsted Agricultural Experimental Station near London, England. Fisher rec¬ 
ognized that flaws in the way the experiment that generated the data had been performed 
often hampered the analysis of data from systems (in this case agricultural systems). By 
interacting with scientists and researchers in many fields he developed the insights that 
led to the three basic principles of experimental design that we discussed in Section 1-3: 
randomization, replication, and blocking. Fisher systematically introduced statistical 
thinking and principles into designing experimental investigations, including the factorial 
design concept and the analysis of variance. His two books [Fisher (1958, 1966)] had 
profound influence on the use of statistics, particularly in agricultural and related life 
sciences. For an excellent biography of Fisher, see Box (1978). 

Although applications of statistical design in industrial settings certainly began in 
the 1930s, the second, or industrial, era was catalyzed by the development of response 
surface methodology (RSM) by Box and Wilson (1951). They recognized and exploited 
the fact that many industrial experiments are fundamentally different from their agricul¬ 
tural counterparts in two ways: (1) the response variable can usually be observed (nearly) 
immediately, and (2) the experimenter can quickly learn crucial information from a small 
group of runs that can be used to plan the next experiment. Box (1999) calls these two 
features of industrial experiments immediacy and sequentiality. Over the next 30 years, 
RSM and other design techniques spread throughout the chemical and the process in¬ 
dustries, mostly in research and development work. George Box was the intellectual 
leader of this movement. However, the application of statistical design at the plant or 
manufacturing process level was still not extremely widespread. Some of the reasons for 
this include inadequate training in basic statistical concepts and methods for engineers 
and other process specialists and the lack of computing resources and user-friendly sta¬ 
tistical software to support the application of statistically designed experiments. 

The increasing interest of Western industry in quality improvement that began in the 
late 1970s ushered in the third era of statistical design. The work of Genichi Taguchi 
[Taguchi and Wu (1980), Kackar (1985), and Taguchi (1987, 1991)] had a si gnifi cant 
impact on expanding the interest in and use of designed experiments. Taguchi advocated 
using designed experiments for what he termed robust parameter design, or 

1. Making processes insensitive to environmental factors or other factors that are 
difficult to control 

2. Making products insensitive to variation transmitted from components 

3. Finding levels of the process variables that force the mean to a desired value 
while simultaneously reducing variability around this value 

Taguchi suggested highly fractionated factorial designs and other orthogonal arrays along 
with some novel statistical methods to solve these problems. The resulting methodology 
generated much discussion and controversy. Part of the controversy arose because Tag¬ 
uchi’s methodology was advocated in the West initially (and primarily) by entrepreneurs, 
and the underlying statistical science had not been adequately peer-reviewed. By the late 
1980s the results of peer review indicated that although Taguchi’s engineering concepts 
and objectives were well-founded, there were substantial problems with his experimental 
strategy and methods of data analysis. For specific details of these issues, see Box (1988), 
Box, Bisgaard, and Fung (1988), Hunter (1985, 1989), Myers and Montgomery (1995), 
and Pignatiello and Ramberg (1992). Many of these concerns are also summarized in 
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the extensive panel discussion in the May 1992 issue of Technometrics [see Nair et al. 
(1992)]. 

There were at least three positive outcomes of the Taguchi controversy. First, de¬ 
signed experiments became more widely used in the discrete parts industries, including 
automotive and aerospace manufacturing, electronics and semiconductors, and many 
other industries that had previously made little use of the technique. Second, the fourth 
era of statistical design began. This era has included a renewed general interest in sta¬ 
tistical design by both researchers and practitioners and the development of many new 
and useful approaches to experimental problems in the industrial world, including alter¬ 
natives to Taguchi’s technical methods that allow his engineering concepts to be carried 
into practice efficiently and effectively. Some of these alternatives will be discussed and 
illustrated in subsequent chapters, particularly Chapter 11. Third, formal education in 
statistical experimental design is becoming part of many engineering programs in uni¬ 
versities, at both undergraduate and graduate levels. The successful integration of good 
experimental design practice into engineering and science is a key factor in future in¬ 
dustrial competitiveness. 


1 -6 SUMMARY: USING STATISTICAL TECHNIQUES 
IN EXPERIMENTATION 

Much of the research in engineering, science, and industry is empirical and makes ex¬ 
tensive use of experimentation. Statistical methods can greatly increase the efficiency of 
these experiments and often strengthen the conclusions so obtained. The proper use of 
statistical techniques in experimentation requires that the experimenter keep the follow¬ 
ing points in mind: 

1. Use your nonstatistical knowledge of the problem. Experimenters are usually 
highly knowledgeable in their fields. For example, a civil engineer working on a problem 
in hydrology typically has considerable practical experience and formal academic train¬ 
ing in this area. In some fields there is a large body of physical theory on which to draw 
in explaining relationships between factors and responses. This type of nonstatistical 
knowledge is invaluable in choosing factors, determining factor levels, deciding how 
many replicates to run, interpreting the results of the analysis, and so forth. Using sta¬ 
tistics is no substitute for thinking about the problem. 

2. Keep the design and analysis as simple as possible. Don’t be overzealous in the 
use of complex, sophisticated statistical techniques. Relatively simple design and analysis 
methods are almost always best. This is a good place to reemphasize step 4 of the 
procedure recommended in Section 1-4. If you do the design carefully and correctly, the 
analysis will almost always be relatively straightforward. However, if you botch 
the design badly, it is unlikely that even the most complex and elegant statistics can save 
the situation. 

3. Recognize the difference between practical and statistical significance. Just be¬ 
cause two experimental conditions produce mean responses that are statistically different, 
there is no assurance that this difference is large enough to have any practical value. For 
example, an engineer may determine that a modification to an automobile fuel injection 
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system may produce a true mean improvement in gasoline mileage of 0.1 mi/gal. This 
is a statistically significant result. However, if the cost of the modification is $1000, the 
0.1 mi/gal difference is probably too small to be of any practical value. 

4. Experiments are usually iterative. Remember that in most situations it is unwise to 
design too comprehensive an experiment at the start of a study. Successful design requires 
knowledge of the important factors, the ranges over which these factors are varied, the 
appropriate number of levels for each factor, and the proper methods and units of mea¬ 
surement for each factor and response. Generally, we are not well-equipped to answer 
these questions at the beginning of the experiment, but we learn the answers as we go 
along. This argues in favor of the iterative, or sequential, approach discussed previously. 
Of course, there are situations where comprehensive experiments are entirely appropriate, 
but as a general rule, most experiments should be iterative. Consequently, we usually 
should not invest more than about 25 percent of the resources of experimentation (runs, 
budget, time, etc.) in the initial experiment. Often these first efforts are just learning 
experiences, and some resources must be available to accomplish the final objectives of 
the experiment. 



Simple Comparative 
Experiments _ 



In this chapter, we consider experiments to compare two conditions (sometimes called 
treatments). These are often called simple comparative experiments. We begin with 
an example of an experiment performed to determine whether two different formulations 
of a product give equivalent results. The discussion leads to a review of several basic 
statistical concepts, such as random variables, probability distributions, random samples, 
sampling distributions, and tests of hypotheses. 


2-1 INTRODUCTION 

The tension bond strength of portland cement mortar is an important characteristic of the 
product. An engineer is interested in comparing the strength of a modified formulation 
in which polymer latex emulsions have been added during mixing to the strength of the 
unmodified mortar. The experimenter has collected 10 observations on strength for 
the modified formulation and another 10 observations for the unmodified formulation. 
The data are shown in Table 2-1 on the next page. We could refer to the two different 
formulations as two treatments or as two levels of the factor formulations. 

The data from this experiment are plotted in Figure 2-1 on the next page. This display 
is called a dot diagram. Visual examination of these data give the immediate impression 
that the strength of the unmodified mortar is greater than the strength of the modified 
mortar. This impression is supported by comparing the average tension bond strengths, 
yj = 16.76 kgf/cm 2 for the modified mortar andy 2 = 17.92 kgf/cm 2 for the unmodified 
mortar. The average tension bond strengths in these two samples differ by what seems 
to be a nontrivial amount. However, it is not obvious that this difference is large enough 
to imply that the two formulations really are different. Perhaps this observed difference 
in average strengths is the result of sampling fluctuation and the two formulations are 
really identical. Possibly another two samples would give opposite results, with the 
strength of the modified mortar exceeding that of the unmodified formulation. 

A technique of statistical inference called hypothesis testing (some prefer signifi¬ 
cance testing) can be used to assist the experimenter in comparing these two formula- 
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Table 2-1 Tension Bond Strength Data 
for the Portland Cement 
Formulation Experiment 
Modified Unmodified 

Mortar Mortar 

i_ yji _ yy 

1 16.85 17.50 

2 16.40 17.63 

3 17.21 18.25 

4 16.35 18.00 

5 16.52 17.86 

6 17.04 17.75 

7 16.96 18.22 

B 17.15 17.90 

9 16.59 17.96 

10 16.57 18.15 


tions. Hypothesis testing allows the comparison of the two formulations to be made on 
objective terms, with a knowledge of the risks associated with reaching the wrong con¬ 
clusion. Before presenting procedures for hypothesis testing in simple comparative ex¬ 
periments, we will briefly summarize some elementary statistical concepts. 


2-2 BASIC STATISTICAL CONCEPTS 

Each of the observations in the portland cement experiment described above would be 
called a run. Notice that the individual runs differ, so there is fluctuation, or noise, in 
the results. This noise is usually called experimental error or simply error. It is a 
statistical error, meaning that it arises from variation that is uncontrolled and generally 
unavoidable. The presence of error or noise implies that the response variable, tension 
bond strength, is a random variable. A random variable may be either discrete or 
continuous. If the set of all possible values of the random variable is either finite or 
countably infinite, then the random variable is discrete, whereas if the set of all possible 
values of the random variable is an interval, then the random variable is continuous. 

Graphical Description of Variability 

We often use simple graphical methods to assist in analyzing the data from an experiment. 
The dot diagram, illustrated in Figure 2-1, is a very useful device for displaying a small 
body of data (say up to about 20 observations). The dot diagram enables the experimenter 
to see quickly the general location or central tendency of the observations and their 
spread. For example, in the portland cement tension bond experiment, the dot diagram 


I_ 1 • •!••• OOP 0031) OOP _I_I 

15 16 a 17 i18 19 20 

Strength 

(kgf/cm 2 ) • = Modified mortar 

yi = 16.76 y 2 = 17.92 

Figure 2-1 Dot diagram for the tension bond strength data in Table 2-1. 


= Unmodified mortar 
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Figure 2-2 Histogram for 200 observations on metal recovery (yield) from a smelting process. 


reveals that the two formulations probably differ in mean strength but that both formu¬ 
lations produce about the same variation in strength. 

If the data are fairly numerous, the dots in a dot diagram become difficult to distin¬ 
guish, and a histogram may be preferable. Figure 2-2 presents a histogram for 200 ob¬ 
servations on the metal recovery, or yield, from a smelting process. The histogram shows 
the central tendency, spread, and general shape of the distribution of the data. Recall that 
a histogram is constructed by dividing the horizontal axis into intervals (usually of equal 
length) and drawing a rectangle over the 7 th interval with the area of the rectangle pro¬ 
portional to tij, the number of observations that fall in that interval. 

The box plot (or box and whisker plot) is a very useful way to display data. A box 
plot displays the minimum, the maximum, the lower and upper quartiles (the 25th per¬ 
centile and the 75th percentile, respectively), and the median (the 50th percentile) on a 
rectangular box aligned either horizontally or vertically. The box extends from the lower 
quartile to the upper quartile, and a line is drawn through the box at the median. Lines 
(or whiskers) extend from the ends of the box to (typically) the minimum and maximum 
values. [There are several variations of box plots that have different rules for denoting 
the extreme sample points. See Montgomery and Runger (1999) for more details.] 

Figure 2-3 (page 24) presents the box plots for the two samples of tension bond 
strength in the portland cement mortar experiment. This display clearly reveals the dif¬ 
ference in mean strength between the two formulations. It also indicates that both for¬ 
mulations produce reasonably symmetric distributions of strength with similar variability 
or spread. 

Dot diagrams, histograms, and box plots are useful for summarizing the information 
in a sample of data. To describe the observations that might occur in a sample more 
completely, we use the concept of the probability distribution. 

Probability Distributions 

The probability structure of a random variable, say y, is described by its probability 
distribution. If y is discrete, we often call the probability distribution of y, say piy), the 
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Modified Unmodified 

Mortar formulation 


Figure 2-3 Box plots for the portland cement tension bond strength 
experiment. 


probability function of y. If y is continuous, the probability distribution of y, say /(y), is 
often called the probability density function for y. 

Figure 2-4 illustrates hypothetical discrete and continuous probability distributions. 
Notice that in the discrete probability distribution, it is the height of the function p(yj) 
that represents probability, whereas in the continuous case, it is the area under the curve 




Figure 2-4 Discrete and continuous probability distributions. 
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/(y) associated with a given interval that represents probability. The properties of prob¬ 
ability distributions may be summarized quantitatively as follows: 

y discrete: 0 «£ p(y ; ) =£ \ all values of y, 

P(y = yj) = piy/} all values of y y 

^Ses m = 1 

of jy, 

y continuous: 0 =£ /(y) 

P(a^y « fe) = f /(y) dy 

f_J(y)dy = 1 


Mean, Variance, and Expected Values 

The mean, p, of a probability distribution is a measure of its central tendency or location. 
Mathematically, we define the mean as 

I f a yf(y ) dy 

2) xpGO 

all 

We may also express the mean in terms of the expected value or the long-run average 
value of the random variable y as 

{ y/(y) dy y continuous 

( 2 - 2 ) 

2j yp(y) y discrete 

all > 


y continuous 
y discrete 


( 2 - 1 ) 


where E denotes the expected value operator. 

The variability or dispersion of a probability distribution can be measured by the 
variance, defined as 


(y ~ pff(y) dy 
2 Cy - pfpiy) 


y continuous 
y discrete 


(2-3) 


Note that the variance can be expressed entirely in terms of expectation because 

<r 2 = E[(y - pi) 2 ] (2-4) 

Finally, the variance is used so extensively that it is convenient to define a variance 
operator V such that 


V(y) - E[(y - pl) 2 ] = a 1 


( 2 - 5 ) 
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The concepts of expected value and variance are used extensively throughout this 
book, and it may be helpful to review several elementary results concerning these op¬ 
erators. If y is a random variable with mean /i, and variance a 2 and c is a constant, then 

1. E(c) = c 

2 . £(y) = ft 

3. E(cy) = cE(y) = c/x 

4. V(c) = 0 

5. V(y) = a- 2 

6. V(cy) = c 2 V(y ) = c 2 cr 2 

If there are two random variables, for example, y, with £(y,) = /x, and VCy^ = 
a 2 and y 2 with £(y 2 ) = /r 2 and V(y 2 ) = a\, we have 

7. £(y, + y 2 ) = £(y,) + E(y 2 ) = ^ ^ 

It is possible to show that 

8. V(y } + y 2 ) = V(y.) + V(y 2 ) + 2 Cov(y„ y 2 ) 
where 


Cov(y,, y 2 ) = E[(y v - ^)(y 2 - /x 2 )] (2-6) 

is the covariance of the random variables y! and y 2 . The covariance is a measure of the 
linear association between y, and y 2 . More specifically, we may show that if y^ and y 2 
are independent, 1 then Cov(yj, y 2 ) = 0. We may also show that 

9. V(y, - ?2> = V(y,) + V(y 2 ) - 2 Cov(y lf y 2 ) 


If y! and y 2 are independent, we have 

10. V(y t ± y 2 ) = VlyO + V(y 2 ) = a\ + <r\ 
and 

11 . £(y, • y 2 ) = £(y,) • £(y 2 ) = pr ih. 

However, note that, in general, 


12 . # 


g(yi) 


regardless of whether or not y! and y 2 are independent. 


2-3 SAMPLING AND SAMPLING DISTRIBUTIONS 


Random Samples, Sample Mean, and Sample Variance 

The objective of statistical inference is to draw conclusions about a population using a 
sample from that population. Most of the methods that we will study assume that random 


1 Note that the converse of this is not necessarily so; that is, we may have Cov(y,, y 2 ) = 0 and yet this does not 
imply independence. For an example, see Hines and Montgomery (1990, pp. 128-129). 
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samples are used. That is, if the population contains N elements and a sample of n of 
them is to be selected, and if each of the Nl/(N — n)\n\ possible samples has an equal 
probability of being chosen, then the procedure employed is called random sampling. 
In practice, it is sometimes difficult to obtain random samples, and tables of random 
numbers, such as Table XI in the Appendix, may be helpful. 

Statistical inference makes considerable use of quantities computed from the obser¬ 
vations in the sample. We define a statistic as any function of the observations in a 
sample that does not contain unknown parameters. For example, suppose that y,,y 2 > • • • > 
y„ represents a sample. Then the sample mean 


i* 

— (2-7) 


and the sample variance 


i (y t ~ f) 1 2 

S 2 = — --- (2-8) 

n - 1 

are both statistics. These quantities are measures of the central tendency and dispersion 
of the sample, respectively. Sometimes S = Vs 2 , called the sample standard devia¬ 
tion, is used as a measure of dispersion. Engineers often prefer to use the standard de¬ 
viation to measure dispersion because its units are the same as those for the variable of 
interest y. 


Properties of the Sample Mean and Variance 

The sample mean y is a point estimator of the population mean /i, and the sample variance 
S 2 is a point estimator of the population variance a 2 . In general, an estimator of an 
unknown parameter is a statistic that corresponds to that parameter. Note that a point 
estimator is a random variable. A particular numerical value of an estimator, computed 
from sample data, is called an estimate. For example, suppose we wish to estimate the 
mean and variance of the breaking strength of a particular type of textile fiber. A random 
sample of n = 25 fiber specimens is tested and the breaking strength is recorded for each. 
The sample mean and variance are computed according to Equations 2-7 and 2-8, re¬ 
spectively, and are y = 18.6 and S 2 = 1.20. Therefore, the estimate of /r, is y = 18.6 and 
the estimate of a 2 is S 2 = 1.20. 

There are several properties required of good point estimators. Two of the most 
important are the following: 

1. The point estimator should be unbiased. That is, the long-run average or 
expected value of the point estimator should be the parameter that is being 
estimated. Although unbiasedness is desirable, this property alone does not 
always make an estimator a good one. 

2. An unbiased estimator should have minimum variance. This property states 
that the minimum variance point estimator has a variance that is smaller than 
the variance of any other estimator of that parameter. 
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We may easily show thaty and S 2 are unbiased estimators of /jl and a 2 , respectively. 
First consider y. Using the properties of expectation, we have 



= /r 


because the expected value of each observation y, is /jl. Thus, y is an unbiased estimator 
of fl. 

Now consider the sample variance S 2 . We have 


2 (* - y) 2 

-rW-J 

= ^ (y t ~ , 2 } 

— ~r e(ss) 

n - 1 


where SS = 2" =1 (y, - y) 2 is the corrected sum of squares of the observations y,. Now 
E(SS) = (y, - VI 2 j (2-9) 

= e [| yf - »> 2 ] 

= 2 (M 2 + <r 2 ) ~ n(fi 2 + <r 2 /n) 

= (n - l)o- 2 (2-10) 

Therefore, 


and we see that S 2 is an unbiased estimator of a 2 . 
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Degrees of Freedom 

The quantity n — 1 in Equation 2-10 is called the number of degrees of freedom of the 
sum of squares SS. This is a very general result; that is, if y is a random variable with 
variance a 2 and SS = 2(y, - y) 2 has v degrees of freedom, then 



The number of degrees of freedom of a sum of squares is equal to the number of inde¬ 
pendent elements in that sum of squares. For example, SS = 2"=i(y, — y) 2 in Equation 
2-9 consists of the sum of squares of the n elements y, ~ y,y 2 ~ y, ■ ■ ■, y n ~ >'■ These 
elements are not all independent because 2” =1 (y, - y) = 0; in fact, only n — 1 of them 
are independent, implying that SS has n — 1 degrees of freedom. 

The Normal and Other Sampling Distributions 

Often we are able to determine the probability distribution of a particular statistic if we 
know the probability distribution of the population from which the sample was drawn. 
The probability distribution of a statistic is called a sampling distribution. We will now 
briefly discuss several useful sampling distributions. 

One of the most important sampling distributions is the normal distribution. If y is 
a normal random variable, the probability distribution of y is 

f(y) = —L= e -«™y-rtM 2 -oo < y < oo (2-12) 

J w oS/Itt 

where - < ya < 00 is the mean of the distribution and cr 2 > 0 is the variance. The 

normal distribution is shown in Figure 2-5. 

Because sample runs that differ as a result of experimental error often are well 
described by the normal distribution, the normal plays a central role in the analysis of 
data from designed experiments. Many important sampling distributions may also be 
defined in terms of normal random variables. We often use the notation y ~ N(fi, a 2 ) to 
denote that y is distributed normally with mean /x and variance cr 2 . 

An important special case of the normal distribution is the standard normal dis¬ 
tribution; that is, n = 0 and cr 2 = 1. We see that if y ~ cr 2 ), the random variable 



Figure 2-5 The normal distribution. 


(2-13) 
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follows the standard normal distribution, denoted z ~ N( 0, 1). The operation demon¬ 
strated in Equation 2-13 is often called standardizing the normal random variable y. The 
cumulative standard normal distribution is given in Table I of the Appendix. 

Many statistical techniques assume that the random variable is normally distributed. 
The central limit theorem is often a justification of approximate normality. 

THEOREM 2-1 . 

The Central Limit Theorem 

If y\> y- 1 , ■ ■ ■, y n is a sequence of n independent and identically distributed random vari¬ 
ables with £(y,) = and V(y,) = a 2 (both finite) and x = y, + y 2 + ■ ■ ■ + y„, then 



has an approximate N( 0, 1) distribution in the sense that, if F n (z) is the distribution 
function of z„ and <l>(z) is the distribution function of the N( 0, 1) random variable, then 
lim n ^[F n (z)/<t>(z)] - 1. 


This result states essentially that the sum of n independent and identically distributed 
random variables is approximately normally distributed. In many cases this approxima¬ 
tion is good for very small n, say n < 10, whereas in other cases large n is required, say 
n > 100. Frequently, we think of the error in an experiment as arising in an additive 
manner from several independent sources; consequently, the normal distribution becomes 
a plausible model for the combined experimental error. 

An important sampling distribution that can be defined in terms of normal random 
variables is the chi-square or \ z distribution. If z u z 2 ,..., z k are normally and 
independently distributed random variables with mean 0 and variance 1, abbreviated 
NID(0, 1), then the random variable 

x = Zi + z§ + • • • + zf 

follows the chi-square distribution with k degrees of freedom. The density function of 
chi-square is 


f{x) = - 1 — r ( ‘ /2) “ 1 e“ ,/2 x > 0 (2-14) 

2 “ r 0) 

Several chi-square distributions are shown in Figure 2-6. The distribution is asym¬ 
metric, or skewed, with mean and variance 


/x = k 


a 2 = 2k 


respectively. Percentage points of the chi-square distribution are given in Table III of the 
Appendix. 
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As an example of a random variable that follows the chi-square distribution, suppose 
that y u y 2 , ..., y n is a random sample from an N(/x, a 2 ) distribution. Then 


2 O’, - y) 2 


(2-15) 


That is, SSI a 2 is distributed as chi-square with n — 1 degrees of freedom. 

Many of the techniques used in this book involve the computation and manipulation 
of sums of squares. The result given in Equation 2-15 is extremely important and occurs 
repeatedly; a sum of squares in normal random variables when divided by cr 2 follows 
the chi-square distribution. 

Examining Equation 2-8, we see that the sample variance can be written as 

S 2 = (2-16) 

n — 1 

If the observations in the sample are NID(/t, a 2 ), then the distribution of S 2 is 
[a 2 /(n - l)]Xn-i- Thus, the sampling distribution of the sample variance is a constant 
times the chi-square distribution if the population is normally distributed. 

If z and xl are independent standard normal and chi-square random variables, re¬ 
spectively, the random variable 


h 


VxtJk 


(2-17) 


follows the t distribution with k degrees of freedom, denoted t k . The density function 
of t is 


m + D/2] i 

m VtnT(k/2) [( t 2 !k) + If + 0/2 


(2-18) 


and the mean and variance of t are /t = 0 and a 2 = k/(k - 2) for k > 2, respectively. 
Several t distributions are shown in Figure 2-7 on page 32. Note that if k = 00 , the t 
distribution becomes the standard normal distribution. The percentage points of the t 
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distribution are given in Table II of the Appendix. If y u y 2 . y„ is a random sample 

from the N(fx, cr 2 ) distribution, then the quantity 


t = y^ 

S/Vn 


(2-19) 


is distributed as t with n - 1 degrees of freedom. 

The final sampling distribution that we will consider is the F distribution. If xt and 
Xu are two independent chi-square random variables with u and v degrees of freedom, 
respectively, then the ratio 


. X 2 Ju 
Xv/v 


( 2 - 20 ) 


follows the F distribution with u numerator degrees of freedom and v denominator 
degrees of freedom. If x is an F random variable with u numerator and v denominator 
degrees of freedom, then the probability distribution of x is 


( 2 - 21 ) 



Figure 2-8 Several F distributions. 
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Several F distributions are shown in Figure 2-8 on the preceding page. This distribution 
is very important in the statistical analysis of designed experiments. Percentage points 
of the F distribution are given in Table IV of the Appendix. 

As an example of a statistic that is distributed as F, suppose we have two independent 
normal populations with common variance a 1 . If y u , y 12 , . • •, yi„,, is a random sample 
of n t observations from the first population, and if y 2 i» y22> • • •» y-in 2 is a random sample 
of n 2 observations from the second, then 

§ - p- 22 > 

0 2 

where Sf and S\ are the two sample variances. This result follows directly from Equations 
2-15 and 2-20. 


2-4 INFERENCES ABOUT THE DIFFERENCES 
IN MEANS, RANDOMIZED DESIGNS 

We are now ready to return to the portland cement mortar problem posed in Section 2-1. 
Recall that two different formulations of mortar were being investigated to determine if 
they differ in tension bond strength. In this section we discuss how the data from this 
simple comparative experiment can be analyzed using hypothesis testing and confidence 
interval procedures for comparing two treatment means. 

Throughout this section we assume that a completely randomized experimental 
design is used. In such a design, the data are viewed as if they were a random sample 
from a normal distribution. 

2-4.1 Hypothesis Testing 

We now reconsider the portland cement experiment introduced in Section 2-1. Recall 
that we are interested in comparing the strength of two different formulations: an un¬ 
modified mortar and a modified mortar. In general, we can think of these two formulations 
as two levels of the factor “formulations.” Let yu, yi 2 ,..., yi„, represent the n { ob¬ 
servations from the first factor level and y 21 , y 22 ,..., y 2 „ 2 represent the n 2 observations 
from the second factor level. We assume that the samples are drawn at random from two 
independent normal populations. Figure 2-9 illustrates the situation. 



Sample 1:y Mf Sample 2: y 2V y 22 . y 2ni 

Factor level 1 Factor level 2 

Figure 2-9 The sampling situation for the two-sample t-test. 
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A Model for the Data 

We often describe the results of an experiment with a model. A simple statistical model 
that describes the data from an experiment such as we have just described is 

ytj = /*» + I \\ Ki (2-23) 

where y v is the jth observation from factor level i, /t, is the mean of the response at 
the /th factor level, and e, y is a normal random variable associated with the z'/th obser¬ 
vation. We assume that % are NID(0, of), i = 1, 2. It is customary to refer to e, y as the 
random error component of the model. Because the means /u,, and /x 2 are constants, we 
see directly from the model that y tJ are NID(/a,, of), i = 1,2, just as we previously 
assumed. For more information about models for the data, refer to the supplemental text 
material. 


Statistical Hypotheses 

A statistical hypothesis is a statement either about the parameters of a probability dis¬ 
tribution or the parameters of a model. The hypothesis reflects some conjecture about 
the problem situation. For example, in the portland cement experiment, we may think 
that the mean tension bond strengths of the two mortar formulations are equal. This may 
be stated formally as 


Ho-Pl = /A2 


where ^ is the mean tension bond strength of the modified mortar and /x 2 is the mean 
tension bond strength of the unmodified mortar. The statement H 0 : ^ ^ is called the 

null hypothesis and /fj: ju-i # ^ is called the alternative hypothesis. The alternative 
hypothesis specified here is called a two-sided alternative hypothesis because it would 
be true if fii < or if /u,, > /j^ 

To test a hypothesis we devise a procedure for taking a random sample, computing 
an appropriate test statistic, and then rejecting or failing to reject the null hypothesis H 0 . 
Part of this procedure is specifying the set of values for the test statistic that leads to 
rejection of H 0 . This set of values is called the critical region or rejection region for 
the test. 

Two kinds of errors may be committed when testing hypotheses. If the null hypoth¬ 
esis is rejected when it is true, a type I error has occurred. If the null hypothesis is not 
rejected when it is false, a type II error has been made. The probabilities of these two 
errors are given special symbols: 

a — P(type I error) = P(reject H 0 \H 0 is true) 

/3 = P(type II error) = P(fail to reject H 0 \H 0 is false) 

Sometimes it is more convenient to work with the power of the test, where 

Power = 1 - j8 = P(reject H 0 \H 0 is false) 

The general procedure in hypothesis testing is to specify a value of the probability of 
type I error a, often called the significance level of the test, and then design the test 
procedure so that the probability of type II error j8 has a suitably small value. 



2-4 INFERENCES ABOUT THE DIFFERENCES IN MEANS, RANDOMIZED DESIGNS 


35 


The Two-Sample t-Test 

Suppose that we could assume that the variances of tension bond strengths were identical 
for both mortar formulations. Then the appropriate test statistic to use for comparing two 
treatment means in the completely randomized design is 



where y, and y 2 are the sample means, n, and n 2 are the sample sizes, S 2 is 
of the common variance a\ = a\ — a 2 computed from 

( n , - 1 )S? + («2 ~ 1) S\ 
p «i + n 2 — 2 


(2-24) 


estimate 


(2-25) 


and Sj and Sj are the two individual sample variances. To determine whether to reject 
H 0 :/x ] = (j , 2 , we would compare t 0 to the t distribution with n y + n 2 - 2 degrees of 
freedom. If 1 1 0 | > t a , 2 ,n,+n 2 - 2 , where t a/2 , ni+ „ 2 _ 2 is the upper a/2 percentage point of the 
t distribution with n t + n 2 - 2 degrees of freedom, we would reject H 0 and conclude 
that the mean strengths of the two formulations of portland cement mortar differ. This 
test procedure is usually called the two-sample t-test. 

This procedure may be justified as follows. If we are sampling from independent 
normal distributions, then the distribution of y, - y 2 is N[/i^ - p 2 , cr 2 (l/«, + l/n 2 )]. 
Thus, if a 2 were known, and if H 0 : ^ were true, the distribution of 



would be N(0, 1). However, in replacing a in Equation 2-26 by S p , the distribution of 
Z 0 changes from standard normal to t with n x + n 2 — 2 degrees of freedom. Now if H 0 
is true, t 0 in Equation 2-24 is distributed as 4,+« 2 -2 and, consequently, we would expect 
100(1 - a) percent of the values of t 0 to fall between -t a , 2 .„,+„ 2 - 2 and t a/2 ^ l+ „ 2 - 2 . A 
sample producing a value of t 0 outside these limits would be unusual if the null hypothesis 
were true and is evidence that H 0 should be rejected. Thus the t distribution with n, + 
n 2 - 2 degrees of freedom is the appropriate reference distribution for the test statistic 
t 0 . That is, it describes the behavior of t 0 when the null hypothesis is true. Note that a is 
the probability of type I error for the test. 

In some problems, one may wish to reject H 0 only if one mean is larger than the 
other. Thus, one would specify a one-sided alternative hypothesis > fi 2 and 

would reject H 0 only if t 0 > t a ,„,+„ 2 - 2 . If one wants to reject H 0 only if p., is less 
than /a 2 , then the alternative hypothesis is < /*->, and one would reject H 0 if 

to — t a „ i+ „ 2 - 2 . 

To illustrate the procedure, consider the portland cement data in Table 2-1. For these 
data, we find that 


Modified Mortar 

y x = 16.76 kgf/cm 2 
= 0.100 
a, =0.316 
n, = 10 


Unmodified Mortar 

y 2 = 17.92 kgf/cm 2 

51 = 0.061 

5 2 = 0.247 
n 2 = 10 
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Because the sample standard deviations are reasonably similar, it is not unreasonable to 
conclude that the population standard deviations (or variances) are equal. Therefore, we 
can use Equation 2-24 to test the hypotheses 

H 0 : Hi = Ha 
Hi'-Hi * fh. 

Furthermore, «i + n 2 — 2 = 10 + 10 — 2 = 18, and if we choose a = 0.05, then we 
would reject H 0 :hi = Hi if the numerical value of the test statistic t 0 > f 0 . 025ilg = 2.101, 
or if t 0 < — to.o 25 .i 8 = — 2.101. These boundaries of the critical region are shown on the 
reference distribution (t with 18 degrees of freedom) in Figure 2-10. 

Using Equation 2-25 we find that 

= (/»! - 1)S? + (n 2 - 1)S| 
p «i + n 2 ~ 2 

9(0.100) + 9(0.061) 

10 + 10-2 
= 0.081 
S p = 0.284 

and the test statistic is 



Because t Q = -9.13 < — to.025.i8 = -2.101, we would reject H 0 and conclude that the 
mean tension bond strengths of the two formulations of portland cement mortar are 
different. 



Figure 2-10 The t distribution with 18 degrees of freedom with the critical 

region ±f 00 25,i8 = ±2.101. 
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The Use of P-Values in Hypothesis Testing 

One way to report the results of a hypothesis test is to state that the null hypothesis was 
or was not rejected at a specified a-value or level of significance. For example, in the 
Portland cement mortar formulation above, we can say that H 0 : /r, = /x 2 was rejected at 
the 0.05 level of significance. This statement of conclusions is often inadequate because 
it gives the decision maker no idea about whether the computed value of the test statistic 
was just barely in the rejection region or whether it was very far into this region. Fur¬ 
thermore, stating the results this way imposes the predefined level of significance on 
other users of the information. This approach may be unsatisfactory because some de¬ 
cision makers might be uncomfortable with the risks implied by a = 0.05. 

To avoid these difficulties, the P-value approach has been adopted widely in prac¬ 
tice. The P-value is the probability that the test statistic will take on a value that is at 
least as extreme as the observed value of the statistic when the null hypothesis H 0 is true. 
Thus, a P-value conveys much information about the weight of evidence against H 0 , and 
so a decision maker can draw a conclusion at any specified level of significance. More 
formally, we define the P-value as the smallest level of significance that would lead to 
rejection of the null hypothesis H 0 . 

It is customary to call the test statistic (and the data) significant when the null hy¬ 
pothesis H 0 is rejected; therefore, we may think of the P-value as the smallest level a at 
which the data are significant. Once the P-value is known, the decision maker can de¬ 
termine how significant the data are without the data analyst formally imposing a pre¬ 
selected level of significance. 

It is not always easy to compute the exact P-value for a test. However, most modem 
computer programs for statistical analysis report P-values, and they can be obtained on 
some handheld calculators. We will show how to approximate the P-value for the port- 
land cement mortar experiment. From Appendix Table II, for a t distribution with 18 
degrees of freedom, the smallest tail area probability is 0.0005, for which t omo5Jg = 
3.922. Now | f 0 | = 9.13 > 3.922, so because the alternative hypothesis is two-sided, we 
know that the P-value must be less than 2(0.0005) = 0.001. Some handheld calculators 
have the capability to calculate P-values. One such calculator is the HP-48. From this 
calculator, we obtain the P-value for the value t 0 = —9.13 in the portland cement mortar 
formulation experiment as P = 3.68 X 10 8 . Thus the null hypothesis H 0 :/x i = jx 2 
would be rejected at any level of significance a > 3.68 X 10 -8 . 

Computer Solution 

There are many statistical software packages that have capability for statistical hypothesis 
testing. The output from the Minitab two-sample t-test procedure applied to the portland 
cement mortar formulation experiment is shown in Table 2-2 on the next page. Notice 
that the output includes some summary statistics about the two samples (the abbreviation 
“SE mean” refers to the standard error of the mean, s/Vn ) as well as some information 
about confidence intervals on the difference in the two means (which we will discuss in 
Sections 2-4.3 and 2-6). The program also tests the hypothesis of interest, allowing the 
analyst to specify the nature of the alternative hypothesis (“not =” implies H x : ¥= 

/jl 2 ) and the choice of a (here a = 0.05). 

The output includes the computed value of t 0 , the P-value (called the significance 
level), and the decision that should be made given the specified value of a. Notice that 
the computed value of the t statistic differs slightly from our manually calculated value 
and that the P-value is reported to be P = 0.0000. Many software packages will not 
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Table 2-2 Minitab Two-Sample t-Test for the Portland Cement Mortar Experiment 
Two Sample f-Test and Confidence Interval 
Two sample T for Modified vs Unmod 


N Mean 
Modified 10 16.774 
Unmod 10 17.922 


StDev SE Mean 
0.309 0.098 
0.248 0.078 


95% Cl for mu Modified - mu Unmod: ( -1.411, -0.885) 

t-Test mu Modified = mu Unmod (vs not =): T = -9.16 
P = 0.0000 DF = 18 
Both use Pooled StDev = 0.280 


report an actual P-value less than 0.0001 and instead return a “default” value. That is 
the case here. 

Checking Assumptions in the t-Test 

In using the t-test procedure we make the assumptions that both samples are drawn from 
independent populations that can be described by a normal distribution, that the standard 
deviation or variances of both populations are equal, and that the observations are in¬ 
dependent random variables. The assumption of independence is critical, and if the run 
order is randomized (and, if appropriate, other experimental units and materials are se¬ 
lected at random), this assumption will usually be satisfied. The equal-variance and nor¬ 
mality assumptions are easy to check using a normal probability plot. 

Generally, probability plotting is a graphical technique for determining whether sam¬ 
ple data conform to a hypothesized distribution based on a subjective visual examination 
of the data. The general procedure is very simple and can be performed quickly with 
most statistics software packages. The supplemental text material discusses manual con¬ 
struction of normal probability plots. 

To construct a probability plot, the observations in the sample are first ranked from 

smallest to largest. That is, the sample y lt y 2 . y„ is arranged as y (1) , y (2) ,..., y (nj 

where y (1) is the smallest observation, y (2) is the second smallest observation, and so forth, 
with >'(„) the largest. The ordered observations y U) are then plotted against their observed 
cumulative frequency (j — 0.5)/«. The cumulative frequency scale has been arranged so 
that if the hypothesized distribution adequately describes the data, the plotted points will 
fall approximately along a straight line; if the plotted points deviate significantly from a 
straight line, the hypothesized model is not appropriate. Usually, the determination of 
whether or not the data plot as a straight line is subjective. 

To illustrate the procedure, suppose that we wish to check the assumption that tension 
bond strength in the portland cement mortar formulation experiment is normally distrib¬ 
uted. We initially consider only the observations from the unmodified mortar formulation. 
A computer-generated normal probability plot is shown in Figure 2-1 la. Most normal 
probability plots present 100(y — 0.5)/« on the left vertical scale (and occasionally 
100[1 - (j — 0.5)/«] is plotted on the right vertical scale), with the variable value plotted 
on the horizontal scale. Some computer-generated normal probability plots convert the 
cumulative frequency to a standard normal z score. A straight line, chosen subjectively, 
has been drawn through the plotted points. In drawing the straight line, you should be 
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Tension bond strength 
(6) Modified mortar 

Figure 2-11 Normal probability plots of tension bond strength in the portland 
cement experiment. 


influenced more by the points near the middle of the plot than by the extreme points. A 
good rule of thumb is to draw the line approximately between the 25th and 75th percentile 
points. This is how the line in Figure 2-1 la was determined. In assessing the ‘ ‘closeness” 
of the points to the straight line, imagine a fat pencil lying along the line. If all the points 
are covered by this imaginary pencil, a normal distribution adequately describes the data. 
Because the points in Figure 2-1 la would pass the fat pencil test, we conclude that the 
normal distribution is an appropriate model for tension bond strength for the unmodified 
mortar. Figure 2-116 presents the normal probability plot for the 10 observations on 
tension bond strength for the modified mortar. Once again, we would conclude that the 
assumption of a normal distribution is reasonable. 

We can obtain an estimate of the mean and standard deviation directly from the 
normal probability plot. The mean is estimated as the 50th percentile on the probability 
plot, and the standard deviation is estimated as the difference between the 84th and 50th 
percentiles. This means that we can verify the assumption of equal population variances 
in the portland cement experiment by simply comparing the slopes of the two straight 
lines in Figure 2-1 la and 2-116. Both lines have very similar slopes, and so the as- 
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sumption of equal variances is a reasonable one. If this assumption is violated, you should 
use the version of the Mest described in Section 2-4.4. The supplemental text material 
has more information about checking assumptions on the Mest. 

When assumptions are badly violated, the performance of the Mest will be affected. 
Generally, small to moderate violations of assumptions are not a major concern, but any 
failure of the independence assumption and strong indications of nonnormality should 
not be ignored. Both the significance level of the test and the ability to detect differences 
between the means will be adversely affected by departures from assumptions. Trans¬ 
formations are one approach to dealing with this problem. We will discuss this in more 
detail in Chapter 3. Nonparametric hypothesis testing procedures can also be used if the 
observations come from nonnormal populations. Refer to Montgomery and Runger 
(1999) for more details. 

An Alternate Justification to the t-Test 

The two-sample Mest we have just presented depends in theory on the underlying as¬ 
sumption that the two populations from which the samples were randomly selected are 
normal. Although the normality assumption is required to develop the test procedure 
formally, as we discussed above, moderate departures from normality will not seriously 
affect the results. It can be argued [e.g., see Box, Hunter, and Hunter (1978)] that the 
use of a randomized design enables one to test hypotheses without any assumptions 
regarding the form of the distribution. Briefly, the reasoning is as follows. If the treat¬ 
ments have no effect, all [20!/(10! 10!)] = 184,756 possible ways that the 20 observations 
could occur are equally likely. Corresponding to each of these 184,756 possible arrange¬ 
ments is a value of t 0 . If the value of t 0 actually obtained from the data is unusually large 
or unusually small with reference to the set of 184,756 possible values, it is an indication 
that j*L\ # ix 2 . 

This type of procedure is called a randomization test. It can be shown that the Mest 
is a good approximation of the randomization test. Thus, we will use Mests (and other 
procedures that can be regarded as approximations of randomization tests) without ex¬ 
tensive concern about the assumption of normality. This is one reason a simple procedure 
such as normal probability plotting is adequate to check the assumption of normality. 


2-4.2 Choice of Sample Size 

Selection of an appropriate sample size is one of the most important aspects of any 
experimental design problem. The choice of sample size and the probability of type II 
error /3 are closely connected. Suppose that we are testing the hypotheses 

//„:/*! = fL2 
+ ih. 

and that the means are not equal so that 8 = /r, — fi^. Because = fi 2 is not true, 

we are concerned about wrongly failing to reject H 0 . The probability of type II error 
depends on the true difference in means 8. A graph of )8 versus 8 for a particular sample 
size is called the operating characteristic curve, or O.C. curve for the test. The /3 error 
is also a function of sample size. Generally, for a given value of 8, the (3 error decreases 
as the sample size increases. That is, a specified difference in means is easier to detect 
for larger sample sizes than for smaller ones. 
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A set of operating characteristic curves for the hypotheses 
Hq-H 1 = /i 2 
A /is 

for the case where the two population variances trf and a\ are unknown but equal 
(ai = u\ = cr 2 ) and for a level of significance of a = 0.05 is shown in Figure 2-12. 
The curves also assume that the sample sizes from the two populations are equal; that 
is, «i = n 2 = n. The parameter on the horizontal axis in Figure 2-12 is 

j _ Ui - Mai _ |s| 

d — -*r~ 2 ^ 

Dividing | 5| by 2(r allows the experimenter to use the same set of curves, regardless of 
the value of the variance (the difference in means is expressed in standard deviation 
units). Furthermore, the sample size used to construct the curves is actually n* = 2n - 1. 
From examining these curves, we note the following: 

1. The greater the difference in means, Hi — 1 x 2 , the smaller the probability of type 
II error for a given sample size and a. That is, for a specified sample size and 
a, the test will detect large differences more easily than small ones. 

2. As the sample size gets larger, the probability of type II error gets smaller for a 
given difference in means and a. That is, to detect a specified difference 8, we 
may make the test more powerful by increasing the sample size. 

Operating characteristic curves are often helpful in selecting a sample size to use in 
an experiment. For example, consider the portland cement mortar problem discussed 
previously. Suppose that if the two formulations differ in mean strength by as much as 
0.5 kgf/cm 2 we would like to detect it with a high probability. Thus, because fx \ ~ ix 2 = 



Figure 2-12 Operating characteristic curves for the two-sided (-test with 
a = 0.05. (Reproduced with permission from “Operating Characteristics 
for the Common Statistical Tests of Significance,” C. L. Ferris, F. E. 
Gmbbs, and C. L. Weaver, Annals of Mathematical Statistics, June 1946.) 
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0.5 kgf/cm 2 is the “critical” difference in means we wish to detect, we find that d, the 
parameter on the horizontal axis of the operating characteristic curve in Figure 2-12, is 

d = 1 Mi ~ Pa\ = ^5 = 025 
2cr 2ar a 


Unfortunately, d involves the unknown parameter a. However, suppose that we think on 
the basis of prior experience that it is very unlikely that the standard deviation of any 
observation on strength would exceed 0.25 kgf/cm 2 . Then using a = 0.25 in the above 
expression for d yields d = 1. If we wish to reject the null hypothesis 95 percent of the 
time when /a, - yu, 2 = 0.5, then /3 = 0.05, and Figure 2-12 with /3 = 0.05 and d = 1 
yields n* = 16, approximately. Therefore, because n* = 2n — 1, the required sample 
size is 


n 


n* + 1 = 16 + 1 
2 “ 2 


8.5 


9 


and we would use sample sizes of rq = n 2 = n = 9. 

In our example, the experimenter actually used a sample size of 10. Perhaps the 
experimenter elected to increase the sample size slightly to guard against the possibility 
that the prior estimate of the common standard deviation cr was too conservative and 
was likely to be somewhat larger than 0.25. 

Operating characteristic curves often play an important role in the choice of sample 
size in experimental design problems. Their use in this respect is discussed in subsequent 
chapters. For a discussion of the uses of operating characteristic curves for other simple 
comparative experiments similar to the two-sample f-test see Montgomery and Runger 
(1999). 


2-4.3 Confidence Intervals 

Although hypothesis testing is a useful procedure, it sometimes does not tell the entire 
story. It is often preferable to provide an interval within which the value of the parameter 
or parameters in question would be expected to lie. These interval statements are called 
confidence intervals. In many engineering and industrial experiments, the experimenter 
already knows that the means fi, and differ; consequently, hypothesis testing on 
j± [ = fx 2 is of little interest. The experimenter would usually be more interested in a 
confidence interval on the difference in means /x, — y^. 

To define a confidence interval, suppose that 0 is an unknown parameter. To obtain 
an interval estimate of 0, we need to find two statistics L and U such that the probability 
statement 

P{L =£ 0 ^ U) = 1 - a (2-27) 

is true. The interval 

L ^ 0^U (2-28) 

is called a 100(1 — a) percent confidence interval for the parameter 0. The interpre¬ 
tation of this interval is that if, in repeated random samplings, a large number of such 
intervals are constructed, 100(1 - a) percent of them will contain the true value of 0. 
The statistics L and U are called the lower and upper confidence limits, respectively, 
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and 1 - a is called the confidence coefficient. If a = 0.05, Equation 2-28 is called a 
95 percent confidence interval for 6. Note that confidence intervals have a frequency 
interpretation; that is, we do not know if the statement is true for this specific sample, 
but we do know that the method used to produce the confidence interval yields correct 
statements 100(1 - a) percent of the time. 

Suppose that we wish to find a 100(1 - a) percent confidence interval on the true 
difference in means /jl, — [jl 2 for the portland cement problem. The interval can be derived 
in the following way. The statistic 


yi ~ yz - Oi - Ma) 



is distributed as t„ l+ „ 2 - 2 . Thus, 



Comparing Equations 2-29 and 2-27, we see that 



is a 100(1 - a) percent confidence interval for /Xi — /x 2 . 

The actual 95 percent confidence interval estimate for the difference in mean tension 
bond strength for the formulations of portland cement mortar is found by substituting in 
Equation 2-30 as follows: 

16.76 - 17.92 - (2.101)0.284^ ^ 

« 16.76 - 17.92 + (2.101)0.284Vto + To 
-1.16 - 0.27 ^ fit ~ 1 12 « -1.16 + 0.27 
-1.43 «£ /j-i /Aj —0.89 

Thus, the 95 percent confidence interval estimate on the difference in means extends 
from -1.43 kgf/cm 2 to -0.89 kgf/cm 2 . Put another way, the confidence interval is 
!±, - /jl 2 = -1.16 kgf/cm 2 ± 0.27 kgf/cm 2 , or the difference in mean strengths is 
-1.16 kgf/cm 2 , and the accuracy of this estimate is ±0.27 kgf/cm 2 . Note that because 
/A, fx 2 ~ 0 is not included in this interval, the data do not support the hypothesis that 
= i± 2 at the 5 percent level of significance. It is likely that the mean strength of the 
unmodified formulation exceeds the mean strength of the modified formulation. Notice 
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from Table 2-2 that Minitab also reported this confidence interval when the hypothesis 
testing procedure was conducted. 


2-4.4 The Case Where a\ # o| 


If we are testing 

H 0 -fi i = 

Hi'. iii + fJh 

and cannot reasonably assume that the variances a 2 and or are equal, then the two- 
sample f-test must be modified slightly. The test statistic becomes 


to = 



(2-31) 


This statistic is not distributed exactly as t. However, the distribution of t 0 is well- 
approximated by t if we use 



( Sl/ni) 2 | (Sl/n 2 ) 2 
«i — 1 n 2 — 1 


(2-32) 


as the degrees of freedom. A strong indication of unequal variances on a normal prob¬ 
ability plot would be a situation calling for this version of the t-test. You should be able 
to develop an equation for finding that confidence interval on the difference in mean for 
the unequal variances case easily. 


2-4.5 The Case Where cr\ and erf Are Known 

If the variances of both populations are known , then the hypotheses 
H 0 :/i! = fia 

Hl'.fl ! * fl2 


may be tested using the statistic 


Zo = 



(2-33) 


If both populations are normal, or if the sample sizes are large enough so that the central 
limit theorem applies, the distribution of Z 0 is 7V(0,1) if the null hypothesis is true. Thus, 
the critical region would be found using the normal distribution rather than the t. Spe¬ 
cifically, we would reject H 0 if | Z 0 1 > Z a/2 , where Z a/2 is the upper a/2 percentage point 
of the standard normal distribution. 
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Unlike the Mest of the previous sections, the test on means with known variances 
does not require the assumption of sampling from normal populations. One can use the 
central limit theorem to justify an approximate normal distribution for the difference in 
sample means y x — y 2 . 

The 100(1 — a) percent confidence interval on fx x — /x 2 where the variances are 
known is 

/tr? tr? /cr? cr? 

yi - y 2 ~ Z a/2 — + — =£ jLti - /X 2 ^ yi - y 2 + Z a/2 — + — (2-34) 

V "l «2 V "i ”2 

As noted previously, the confidence interval is often a useful supplement to the hypothesis 
testing procedure. 


2*4.6 Comparing a Single Mean to a Specified Value 

Some experiments involve comparing only one population mean /x to a specified value, 
say, (jl 0 . The hypotheses are 

H 0 :/x = fi 0 

Hi .fi * jxq 

If the population is normal with known variance, or if the population is nonnormal but 
the sample size is large enough so that the central limit theorem applies, then the hy¬ 
pothesis may be tested using a direct application of the normal distribution. The test 
statistic is 


Zo 


y - /*o 
cr/Vn 


(2-35) 


If H 0 : /x = ii 0 is true, then the distribution of Z 0 is N( 0, 1). Therefore, the decision rule 
for H 0 : ix = fx 0 is to reject the null hypothesis if | Z 0 1 > Z at2 . The value of the mean /x 0 
specified in the null hypothesis is usually determined in one of three ways. It may result 
from past evidence, knowledge, or experimentation. It may be the result of some theory 
or model describing the situation under study. Finally, it may be the result of contractual 
specifications. 

The 100(1 — a) percent confidence interval on the true population mean is 

y ~ Z a/2 <r/Vrt «£ fx =£ y + Z a/ 2 cr/Vn (2-36) 


EXAMPLE 2-1 . 

A vendor submits lots of fabric to a textile manufacturer. The manufacturer wants to 
know if the lot average breaking strength exceeds 200 psi. If so, she wants to accept the 
lot. Past experience indicates that a reasonable value for the variance of breaking strength 
is 100(psi) 2 . The hypotheses to be tested are 

H 0 -.fx = 200 
H^.il > 200 

Note that this is a one-sided alternative hypothesis. Thus, we would accept the lot only 
if the null hypothesis H 0 :/x = 200 could be rejected (i.e., if Z 0 > Z a ). 
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Four specimens are randomly selected, and the average breaking strength observed 
is y = 214 psi. The value of the test statistic is 


_ y - /ip _ 214 - 200 
° ~ a/Vn ~ 10/V4 


2.80 


If a type I error of a = 0.05 is specified, we find Z a = Z 0 05 = 1.645 from Appendix 
Table I. Thus H 0 is rejected, and we conclude that the lot average breaking strength 
exceeds 200 psi. 


If the variance of the population is unknown, we must make the additional assump¬ 
tion that the population is normally distributed, although moderate departures from nor¬ 
mality will not seriously affect the results. 

To test H 0 :/x = /x 0 in the variance unknown case, the sample variance S 2 is used to 
estimate cr 2 . Replacing a with S in Equation 2-35, we have the test statistic 


. _ y ~ Vo 
° SlV~n 


(2-37) 


The null hypothesis H 0 :(jl = / a 0 would be rejected if 1 1 0 \ > t af%n - \> where t a/2 , n - 1 denotes 
the upper all percentage point of the t distribution with n — 1 degrees of freedom. The 
100(1 - a) percent confidence interval in this case is 

y - t a/2 ,n-iS/Vn *£ /A « y + t a /2,n-iS/Vn (2-38) 


2-4.7 Summary 

Tables 2-3 and 2-4 summarize the test procedures discussed above for sample means. 
Critical regions are shown for both two-sided and one-sided alternative hypotheses. 


Table 2-3 Tests on Means with Variance Known 


Hypothesis 

Test Statistic 

Criteria for Rejection 

H 0 -V = Vo 

H, :n + fio 


I Z<) | > Za /2 

H 0 -V = Vo 

H 1 :/i < fi 0 

7 _y ~ Vo 

Z ° ' cr/Vn 

Z 0 <-Z a 

H 0 . f Ji = Vo 
H,-V > Vo 


Zo>Z a 

H 0 :v i = V2 
H\-V i * Vi 


\z 0 \>z a/2 

HoVi = Vi 

Hi', fa < n? 

^ yi - yi 

z 0 < -z a 




Ho' Vi ~ V2 

Hi :fii > /x2 


z 0 > Z a 
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Table 2-4 Tests on Means of Normal Distributions, 
Variance Unknown 


Hypothesis 

Test Statistic 

Criteria for Rejection 

Ho', n = Ho 
Hi'-H * Ho 


Uol > /o/2,n— 1 

H 0 :h = Ho 
H 1 :h< Ho 

, y ~ Ho 

° S/V~n 

to < “fa.,,-1 

Hq'-h = Ho 
H 1 :h> Ho 


to > t a .n- 1 

if <r\ = of 

H 0 -Hi = H2 

Hi'. Hi* H 2 

... yi ~ y-i 
s /Ul 

\ "l "2 
v = rh + n 2 ~ 2 

1 to | > ta/2,u 

if <t\ + 02 

t= j’ 

A II 

J J 

t yi-n 

0 Is] ! sl 

to < ~t a , v 


yjn , n 2 


H 0 : Hi = M2 

(Si + S i) 2 

to > tot.u 

H x :h\ > Hi 

\«1 "2/ 

v ~ {StKf | (Si/« 2 ) 2 



«! - 1 fl 2 1 



2-5 INFERENCES ABOUT THE DIFFERENCES IN MEANS, 

PAIRED COMPARISON DESIGNS 

2-5.1 The Paired Comparison Problem 

In some simple comparative experiments we can greatly improve the precision by making 
comparisons within matched pairs of experimental material. For example, consider a 
hardness testing machine that presses a rod with a pointed tip into a metal specimen with 
a known force. By measuring the depth of the depression caused by the tip, the hardness 
of the specimen is determined. Two different tips are available for this machine, and 
although the precision (variability) of the measurements made by the two tips seems to 
be the same, it is suspected that one tip produces different hardness readings than the 
other. 

An experiment could be performed as follows. A number of metal specimens (e.g., 
20) could be randomly selected. Half of these specimens could be tested by tip 1 and the 
other half by tip 2. The exact assignment of specimens to tips would be randomly de¬ 
termined. Because this is a completely randomized design, the average hardness of the 
two samples could be compared using the /-test described in Section 2-4. 

A little reflection will reveal a serious disadvantage in the completely randomized 
design for this problem. Suppose the metal specimens were cut from different bar stock 
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that were produced in different heats or that were not exactly homogeneous in some other 
way that might affect the hardness. This lack of homogeneity between specimens will 
contribute to the variability of the hardness measurements and will tend to inflate the 
experimental error, thus making a true difference between tips harder to detect. 

To protect against this possibility, consider an alternative experimental design. As¬ 
sume that each specimen is large enough so that two hardness determinations may be 
made on it. This alternative design would consist of dividing each specimen into two 
parts, then randomly assigning one tip to one-half of each specimen and the other tip to 
the remaining half. The order in which the tips are tested for a particular specimen would 
also be randomly selected. The experiment, when performed according to this design 
with 10 specimens, produced the (coded) data shown in Table 2-5. 

We may write a statistical model that describes the data from this experiment as 

I;*. 10 e-39) 

where y tj is the observation on hardness for tip i on specimen j, /x, is the true mean 
hardness of the ith tip, /S, is an effect on hardness due to the y'th specimen, and e, y is a 
random experimental error with mean zero and variance of. That is, of is the variance 
of the hardness measurements from tip 1 and of is the variance of the hardness mea¬ 
surements from tip 2. 

Note that if we compute the y'th paired difference 

dj = yij ~ y 2j j = 1, 2.10 (2-40) 

the expected value of this difference is 
fjL d = E(dj ) 

= E(y u - y 2 ,) 

= E(y tj ) - E(y 2 j) 

= l*i + 1 3; - (A*2 + Pj) 

= IX. X ~ Ho. 

That is, we may make inferences about the difference in the mean hardness readings of 
the two tips /X | — /x 2 by making inferences about the mean of the differences /u , d . Notice 

Table 2-5 Data for the Hardness 


Testing Experiment 


Specimen 

Tip 1 

Tip 2 

1 

7 

6 

2 

3 

3 

3 

3 

5 

4 

4 

3 

5 

8 

8 

6 

3 

2 

7 

2 

4 

8 

9 

9 

9 

5 

4 

10 

4 

5 
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that the additive effect of the specimens j 8 , cancels out when the observations are paired 
in this manner. 

Testing H 0 : is equivalent to testing 

Ho-Vd = 0 
0 


The test statistic for this hypothesis is 


d 

SjVn 


(2-41) 


where 


d = ~n%dj 

is the sample mean of the differences and 



1/2 


L n- 1 J 


L n - 1 J 


(2-42) 


(2-43) 


is the sample standard deviation of the differences. H 0 :y d = 0 would be rejected if 
\t 0 \ > t a / 2t n— i• Because the observations from the factor levels are “paired” on each 
experimental unit, this procedure is usually called the paired /-test. 

For the data in Table 2-5, we find 


d x = 7 — 6=1 
d 2 = 3-3 = 0 
d 3 = 3 - 5 = -2 
fl ? 4 = 4 - 3 = 1 
d 5 = 8-8 = 0 


d 6 = 3-2=1 
d n = 2 - 4 = -2 
d s = 9 - 9 = 0 
d 9 = 5-4=1 
d l0 = 4 — 5 = -1 


Thus, 



1/2 

ri3-^(-D 2 T 

L n - 1 J 

L io-i J 


Suppose we choose a = 0.05. Now to make a decision, we would compute t 0 and reject 
tfoifkol > to. 025,9 = 2.262. 

The computed value of the paired /-test statistic is 

- 

“ SJsTn 
_ - 0.10 
~~ 1.20/VlO 
= -0.26 
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<0 


Figure 2-13 The reference distribution (t with 9 degrees of freedom) for the 
hardness testing problem. 


and because 1 1 0 | = 0.26 > fo.025,9 = 2.262, we cannot reject the hypothesis H 0 : jx d = 0. 
That is, there is no evidence to indicate that the two tips produce different hardness 
readings. Figure 2-13 shows the t 0 distribution with 9 degrees of freedom, the reference 
distribution for this test, with the value of t 0 shown relative to the critical region. 

Table 2-6 shows the computer output from the Minitab paired f-test procedure for 
this problem. Notice that the P-value for this test is P — 0.80, implying that we cannot 
reject the null hypothesis at any reasonable level of significance. 


2*5.2 Advantages of the Paired Comparison Design 

The design actually used for this experiment is called the paired comparison design, 
and it illustrates the blocking principle discussed in Section 1-3. Actually, it is a special 
case of a more general type of design called the randomized block design. The term 
block refers to a relatively homogeneous experimental unit (in our case, the metal spec¬ 
imens are the blocks), and the block represents a restriction on complete randomization 
because the treatment combinations are only randomized within the block. We look at 
designs of this type in Chapter 4. In that chapter the mathematical model for the design, 
Equation 2-39, is written in a slightly different form. 


Table 2-6 Minitab Paired t-Test Results for the Hardness Testing Example 
Paired t-Test and Confidence Interval 

Paired T for Tip 1 - Tip 2 


N 

Tipi 10 
Tip 2 10 
Difference 10 


Mean StDev 

4.800 2.394 

4.900 2.234 

-0.100 1.197 


95% Cl for mean difference: (-0.956, 0.756) 
t-Test of mean difference = 0 (vs not = 0): 
T-Value = -0.26 P-Value = 0.798 


SE Mean 
0.757 
0.706 
0.379 
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Before leaving this experiment, several points should be made. Note that, although 
2 n = 2(10) = 20 observations have been taken, only n — 1=9 degrees of freedom are 
available for the t statistic. (We know that as the degrees of freedom for t increase the 
test becomes more sensitive.) By blocking or pairing, we have effectively “lost” n — 1 
degrees of freedom, but we hope we have gained a better knowledge of the situation by 
eliminating an additional source of variability (the difference between specimens). We 
may obtain an indication of the quality of information produced from the paired design 
by comparing the standard deviation of the differences S d with the pooled standard de¬ 
viation S p that would have resulted had the experiment been conducted in a completely 
randomized manner and the data of Table 2-5 been obtained. Using the data in Table 2- 
5 as two independent samples, we compute the pooled standard deviation from Equation 
2-25 to be S p = 2.32. Comparing this value to S d = 1.20, we see that blocking or pairing 
has reduced the estimate of variability by nearly 50 percent. We may also express this 
information in terms of a confidence interval on fJn - /x 2 . Using the paired data, a 95 
percent confidence interval on fi x — ^ is 

d ± to.025.9SdA/tt 

-0.10 ± (2.262)(1.20)/VlO 

- 0.10 ± 0.86 

Conversely, using the pooled or independent analysis, a 95 percent confidence interval 
on fii — 112 is 

yi ~ y2 — to.025,18 S p 

4.80 - 4.90 ± (2.101)(2.32)Vp5 + To 
-0.10 ± 2.18 

The confidence interval based on the paired analysis is much narrower than the confi¬ 
dence interval from the independent analysis. This illustrates the noise reduction prop¬ 
erty of blocking. 

Blocking is not always the best design strategy. If the within-block variability is the 
same as the between-block variability, the variance of y x — y 2 will be the same regardless 
of which design is used. Actually, blocking in this situation would be a poor choice of 
design because blocking results in the loss of n — 1 degrees of freedom and will actually 
lead to a wider confidence interval on /jl x — A further discussion of blocking is given 
in Chapter 4. 



2-6 INFERENCES ABOUT THE VARIANCES 
OF NORMAL DISTRIBUTIONS 

In many experiments, we are interested in possible differences in the mean response for 
two treatments. However, in some experiments it is the comparison of variability in the 
data that is important. In the food and beverage industry, for example, it is important 
that the variability of filling equipment be small so that all packages have close to the 
nominal net weight or volume of content. In che mi cal laboratories, we may wish to 
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compare the variability of two analytical methods. We now briefly examine tests of 
hypotheses and confidence intervals for variances of normal distributions. Unlike the 
tests on means, the procedures for tests on variances are rather sensitive to the normality 
assumption. A good discussion of the normality assumption is in Appendix 2A of Davies 
(1956). 

Suppose we wish to test the hypothesis that the variance of a normal population 
equals a constant, for example, al. Stated formally, we wish to test 

H 0 :a 2 = o-o 

Hi.a 2 =£ Oo (2-44) 


The test statistic for Equation 2-44 is 


, SS (n - 1)S 2 
Xo = ~2 = - ~2 - 


(2-45) 


where SS = 2” =1 (y, — y) 2 is the corrected sum of squares of the sample observations. 
The appropriate reference distribution for xl is the chi-square distribution with n — 1 
degrees of freedom. The null hypothesis is rejected if xl > x\a,n-\ or if xl < 
Xi-(u/2),n- 1 , where xl/2,n- 1 and Xi-(a/2),„- 1 are the upper a/2 and lower 1 - (a/2) 
percentage points of the chi-square distribution with n — 1 degrees of freedom, respec¬ 
tively. Table 2-7 (on the facing page) gives the critical regions for the one-sided alter¬ 
native hypotheses. The 100(1 — a) percent confidence interval on a 2 is 


(n - 1 )S 2 ^ 2 „ (n - 1)S 2 

-JO' —2 - 

Xa/2,n-l X 1—(a/2),n— 1 


(2-46) 


Now consider testing the equality of the variances of two normal populations. If 
independent random samples of size n x and n 2 are taken from populations 1 and 2, 
respectively, the test statistic for 

H 0 :a\ = oi 

H 1 :a 2 # oi (2-47) 

is the ratio of the sample variances 

Fo = | (2-48) 


The appropriate reference distribution for F 0 is the F distribution with n t - 1 numerator 
degrees of freedom and n 2 - 1 denominator degrees of freedom. The null hypothesis 
would be rejected if F 0 > F a/ or if F 0 < i 7 !_ (Q ./ 2 ).« ] — ii , where F a/Xni - u „ 2 -i 
and F, denote the upper a/2 and lower 1 - (a/2) percentage points of the 

F distribution with n x — 1 and n 2 — l degrees of freedom. Table IV of the Appendix 
gives only upper-tail percentage poins of F; however, the upper- and lower-tail points 
are related by 


Fi 




x,V 2 ,V t 


(2-49) 
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Table 2-7 Tests on Variances of Normal Distributions 


Hypothesis 

Test Statistic 

Criteria for Rejection 

Ho-.o 2 = oi 
//,: a 2 + o% 


Xl > Xl/ 2 .n 1 or 
*0 < X\-<xJ2,n— 1 

H 0 : a 2 = Oq 
H l :a 2 < Oq 

, in ~ DS 2 
* 0= 00 

xl < 1 

Ho-.o 2 = oi 
H l :o 2 >o 2 0 


xl > xl *-1 

Ho-.o 2 = <ri 
H x -.a\ ¥= ol 

0 si 

F 0 > F«/2, ni - or 
Fo < F\— a j2*i —1.«2— 1 

H 0 :oi = of 

Hi : ol < o| 

-I 

F 0 > 

H 0 : oj = 

Hi: d\ > of 


F 0 > Fa.^- 1^-1 


Test procedures for more than two variances are discussed in Chapter 3, Section 3-4.3. 
We will also discuss the use of the variance or standard deviation as a response variable 
in more general experimental settings. 

EXAMPLE 2-2 . 

A chemical engineer is investigating the inherent variability of two types of test equip¬ 
ment that can be used to monitor the output of a production process. He suspects that 
the old equipment, type 1, has a larger variance than the new one. Thus, he wishes to 
test the hypothesis 

H 0 \a\ = oi 
Hi’.a] > a\ 

Two random samples of «, = 12 and n 2 = 10 observations are taken, and the sample 
variances are 5? = 14.5 and Si = 10.8. The test statistic is 



From Appendix Table IV we find that F 0 0 5 ,i 1,9 = 3.10, so the null hypothesis cannot be 
rejected. That is, we have found insufficient statistical evidence to conclude that the 
variance of the old equipment is greater than the variance of the new equipment. 


The 100(1 - a) confidence interval for the ratio of the population variances or/cr 2 
is 



(2-50) 
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To illustrate the use of Equation 2-50, the 95 percent confidence interval for the ratio of 
variances a-\la\ in Example 2-2 is, using Fo.025,9,11 = 3.59 and fo.975,9,11 = l/fo.025,11,9 
= 1/3.92 = 0.255, 


0.34 « ^ 4.81 


2-7 PROBLEMS - 

2-1. The breaking strength of a fiber is required to be at least 150 psi. Past experience has 
indicated that the standard deviation of breaking strength is a = 3 psi. A random sample 
of four specimens is tested, and the results are ^ = 145, y 2 = 153, y 3 = 150, and y 4 = 
147. 

(a) State the hypotheses that you think should be tested in this experiment. 

(b) Test these hypotheses using a = 0.05. What are your conclusions? 

(c) Find the P-value for the test in part (b). 

(d) Construct a 95 percent confidence interval on the mean breaking strength. 

2-2. The viscosity of a liquid detergent is supposed to average 800 centistokes at 25°C. A 
random sample of 16 batches of detergent is collected, and the average viscosity is 812. 
Suppose we know that the standard deviation of viscosity is a = 25 centistokes. 

(a) State the hypotheses that should be tested. 

(b) Test these hypotheses using a = 0.05. What are your conclusions? 

(c) What is the P-value for the test? 

(d) Find a 95 percent confidence interval on the mean. 

2-3. The diameters of steel shafts produced by a certain manufacturing process should have a 
mean diameter of 0.255 inches. The diameter is known to have a standard deviation of 
cr = 0.0001 inch. A random sample of 10 shafts has an average diameter of 0.2545 inch. 

(a) Set up appropriate hypotheses on the mean /x. 

(b) Test these hypotheses using a = 0.05. What are your conclusions? 

(c) Find the P-value for this test. 

(d) Construct a 95 percent confidence interval on the mean shaft diameter. 

2-4. A normally distributed random variable has an unknown mean /x and a known variance 
a 2 = 9. Find the sample size required to construct a 95 percent confidence interval on 
the mean that has total width of 1.0. 

2-5. The shelf life of a carbonated beverage is of interest. Ten bottles are randomly selected 
and tested, and the following results are obtained: 


Days 

108 

138 

124 

163 

124 

159 

106 

134 

115 

139 


(a) We would like to demonstrate that the mean shelf life exceeds 120 days. Set up 
appropriate hypotheses for investigating this claim. 

(b) Test these hypotheses using a = 0.01. What are your conclusions? 
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(c) Find the P-value for the test in part (b). 

(d) Construct a 99 percent confidence interval on the mean shelf life. 

2-6. Consider the shelf life data in Problem 2-5. Can shelf life be described or modeled ade¬ 
quately by a normal distribution? What effect would violation of this assumption have on 
the test procedure you used in solving Problem 2-5? 

2-7. The time to repair an electronic instrument is a normally distributed random variable 
measured in hours. The repair times for 16 such instruments chosen at random are as 
follows: 


Hours 


159 280 101 
224 379 179 
222 362 168 
149 260 485 


212 

264 

250 

170 


(a) You wish to know if the mean repair time exceeds 225 hours. Set up appropriate 
hypotheses for investigating this issue. 

(b) Test the hypotheses you formulated in part (a). What are your conclusions? Use 
a = 0.05. 

(c) Find the P-value for the test. 

(d) Construct a 95 percent confidence interval on mean repair time. 

2-8. Reconsider the repair time data in Problem 2-7. Can repair time, in your opinion, be 
adequately modeled by a normal distribution? 

2-9. Two machines are used for filling plastic bottles with a net volume of 16.0 ounces. The 
filling processes can be assumed to be normal, with standard deviations of er t = 0.015 
and cr 2 = 0.018. The quality engineering department suspects that both machines fill to 
the same net volume, whether or not this volume is 16.0 ounces. An experiment is per¬ 
formed by taking a random sample from the output of each machine. 


Machine 1 Machine 2 


16.03 16.01 

16.04 15.96 

16.05 15.98 

16.05 16.02 

16.02 15.99 


16.02 16.03 

15.97 16.04 

15.96 16.02 

16.01 16.01 

15.99 16.00 


(a) State the hypotheses that should be tested in this experiment. 

(b) Test these hypotheses using a = 0.05. What are your conclusions? 

(c) Find the P-value for this test. 

(d) Find a 95 percent confidence interval on the difference in mean fill volume for the 
two machines. 

2-10. Two types of plastic are suitable for use by an electronic calculator manufacturer. The 
breaking strength of this plastic is important. It is known that a, = a 2 = 1.0 psi. From 
random samples of rii = 10 and n 2 = 12 we obtain y, = 162.5 and y 2 = 155.0. The 
company will not adopt plastic 1 unless its breaking strength exceeds that of plastic 2 by 
at least 10 psi. Based on the sample information, should they use plastic 1? In answering 
this question, set up and test appropriate hypotheses using a = 0.01. Construct a 99 percent 
confidence interval on the true mean difference in breaking strength. 
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2-11. The following are the burning times of chemical flares of two different formulations. The 
design engineers are interested in both the mean and variance of the burning times. 


Type 1 

Type 2 

65 

82 

64 

56 

81 

67 

71 

69 

57 

59 

83 

74 

66 

75 

59 

82 

82 

70 

65 

79 


(a) Test the hypothesis that the two variances are equal. Use a = 0.05. 

(b) Using the results of (a), test the hypothesis that the mean burning times are equal. 
Use a = 0.05. What is the P-value for this test? 

(c) Discuss the role of the normality assumption in this problem. Check the assumption 
of normality for both types of flares. 

2-12. An article in Solid State Technology, “Orthogonal Design for Process Optimization and 
Its Application to Plasma Etching” by G. Z. Yin and D. W. Jillie (May, 1987) describes 
an experiment to determine the effect of the C 2 F 6 flow rate on the uniformity of the etch 
on a silicon wafer used in integrated circuit manufacturing. Data for two flow rates are as 
follows: 





Uniformity Observation 



(SCCM) 

1 

2 

3 

4 

5 

6 

125 

2.7 

4.6 

2.6 

3.0 

3.2 

3.8 

200 

4.6 

3.4 

2.9 

3.5 

4.1 

5.1 


(a) Does the C 2 F 6 flow rate affect average etch uniformity? Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) Does the C 2 F 6 flow rate affect the wafer-to-wafer variability in etch uniformity? Use 
a = 0.05. 

(d) Draw box plots to assist in the interpretation of the data from this experiment. 

2-13. A new filtering device is installed in a chemical unit. Before its installation, a random 

sample yielded the following information about the percentage of impurity: y, = 12.5, 
S? = 101.17, and n, = 8. After installation, a random sample yielded y 2 = 10.2, 
Si = 94.73, n 2 = 9. 

(a) Can you conclude that the two variances are equal? Use a = 0.05. 

(b) Has the filtering device reduced the percentage of impurity significantly? Use 
a = 0.05. 

2-14. Twenty observations on etch uniformity on silicon wafers are taken during a qualification 
experiment for a plasma etcher. The data are as follows: 

5.34 6.65 4.76 5.98 7.25 

6.00 7.55 5.54 5.62 6.21 

5.97 7.35 5.44 4.39 4.98 

5.25 6.35 4.61 6.00 5.32 

(a) Construct a 95 percent confidence interval estimate of o 2 . 

(b) Test the hypothesis that o 2 = 1.0. Use a = 0.05. What are your conclusions? 
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(c) Discuss the normality assumption and its role in this problem. 

(d) Check normality by constructing a normal probability plot. What are your 
conclusions? 

2-15. The diameter of a ball bearing was measured by 12 inspectors, each using two different 
kinds of calipers. The results were 


Inspector 

Caliper 1 

Caliper 2 

1 

0.265 

0.264 

2 

0.265 

0.265 

3 

0.266 

0.264 

4 

0.267 

0.266 

5 

0.267 

0.267 

6 

0.265 

0.268 

7 

0.267 

0.264 

8 

0.267 

0.265 

9 

0.265 

0.265 

10 

0.268 

0.267 

11 

0.268 

0.268 

12 

0.265 

0.269 


(a) Is there a significant difference between the means of the population of measurements 
from which the two samples were selected? Use a = 0.05. 

(b) Find the P-value for the test in part (a). 

(c) Construct a 95 percent confidence interval on the difference in mean diameter mea¬ 
surements for the two types of calipers. 

2-16. An article in the Journal of Strain Analysis (vol. 18, no. 2, 1983) compares several pro¬ 
cedures for predicting the shear strength for steel plate girders. Data for nine girders in 
the form of the ratio of predicted to observed load for two of these procedures, the Karls¬ 
ruhe and Lehigh methods, are as follows: 


Girder 

Karlsruhe Method 

Lehigh Method 

Sl/1 

1.186 

1.061 

S2/1 

1.151 

0.992 

S3/1 

1.322 

1.063 

S4/1 

1.339 

1.062 

S5/1 

1.200 

1.065 

S2/1 

1.402 

1.178 

S2/2 

1.365 

1.037 

S2/3 

1.537 

1.086 

S2/4 

1.559 

1.052 


(a) Is there any evidence to support a claim that there is a difference in mean perfor¬ 
mance between the two methods? Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) Construct a 95 percent confidence interval for the difference in mean predicted to 
observed load. 

(d) Investigate the normality assumption for both samples. 

(e) Investigate the normality assumption for the difference in ratios for the two methods. 

(f) Discuss the role of the normality assumption in the paired f-test. 
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2-17. The deflection temperature under load for two different formulations of ABS plastic pipe 
is being studied. Two samples of 12 observations each are prepared using each formulation 
and the deflection temperatures (in °F) are reported below: 


Formulation 1 Formulation 2 


206 

188 

205 

187 


193 

207 

185 

189 


192 

210 

194 

178 


177 

197 

206 

201 


176 

185 

200 

197 


198 

188 

189 

203 


(a) Construct normal probability plots for both samples. Do these plots support as¬ 
sumptions of normality and equal variance for both samples? 

(b) Does the data support the claim that the mean deflection temperature under load for 
formulation 1 exceeds that of formulation 2? Use a = 0.05. 

(c) What is the F-value for the test in part (a)? 

2-18. Refer to the data in Problem 2-17. Do the data support a claim that the mean deflection 
temperature under load for formulation 1 exceeds that of formulation 2 by at least 3°F? 
2-19. In semiconductor manufacturing wet chemical etching is often used to remove silicon 
from the backs of wafers prior to metalization. The etch rate is an important characteristic 
of this process. Two different etching solutions are being evaluated. Eight randomly se¬ 
lected wafers have been etched in each solution and the observed etch rates (in mils/min) 
are shown below. 


Solution 1 Solution 2 


9.9 10.6 10.2 

9.4 10.3 10.0 

10.0 9.3 10.7 

10.3 9.8 10.5 


10.6 

10.2 

10.4 

10.3 


(a) Do the data indicate that the claim that both solutions have the same mean etch rate 
is valid? Use a = 0.05 and assume equal variances. 

(b) Find a 95 percent confidence interval on the difference in mean etch rates. 

(c) Use normal probability plots to investigate the adequacy of the assumptions of nor¬ 
mality and equal variances. 

2-20. Two popular pain medications are being compared on the basis of the speed of absorption 
by the body. Specifically, tablet 1 is claimed to be absorbed twice as fast as tablet 2. 
Assume that aj and Oj are known. Develop a test statistic for 

H 0 :2fi, = fi2 

Hx'.lfii ¥= fa 

2-21. Suppose we are testing 

Ho-fii = Ha 
H, :fi i * H2 

where a 2 , and crj are known. Our sampling resources are constrained such that 
n, + n 2 = N. How should we allocate the N observations between the two populations 
to obtain the most powerful test? 

2-22. Develop Equation 2-46 for a 100(1 — a) percent confidence interval for the variance of 
a normal distribution. 
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2-23. Develop Equation 2-50 for a 100(1 - a) percent confidence interval for the ratio oiler\, 
where o\ and o\ are the variances of two normal distributions. 

2-24. Develop an equation for finding a 100(1 - a) percent confidence interval on the difference 
in the means of two normal distributions where oj + o%. Apply your equation to the 
Portland cement experiment data, and find a 95 pecent confidence interval. 

2-25. Construct a data set for which the paired t-test statistic is very large, but for which the 
usual two-sample or pooled t-test statistic is small. In general, describe how you created 
the data. Does this give you any insight regarding how the paired f-test works? 




Experiments 
with a Single 
Factor: The Analysis 
of Variance 


In Chapter 2 we discussed methods for comparing two conditions or treatments. For 
example, the portland cement tension bond experiment involved two different mortar 
formulations. Another way to describe this experiment is as a single-factor experiment 
with two levels of the factor, where the factor is mortar formulation and the two levels 
are the two different formulation methods. Many experiments of this type involve more 
than two levels of the factor. In this chapter we present methods for the design and 
analysis of single-factor experiments with a levels of the factor (or a treatments). We 
will assume that the experiment has been completely randomized. 


3-1 AN EXAMPLE 

A product development engineer is interested in investigating the tensile strength of a 
new synthetic fiber that will be used to make cloth for men’s shirts. The engineer knows 
from previous experience that the strength is affected by the weight percent of cotton 
used in the blend of materials for the fiber. Furthermore, she suspects that increasing the 
cotton content will increase the strength, at least initially. She also knows that cotton 
content should range between about 10 and 40 percent if the final product is to have 
other quality characteristics that are desired (such as the ability to take a permanent-press 
finishing treatment). The engineer decides to test specimens at five levels of cotton weight 
percent: 15, 20, 25, 30, and 35 percent. She also decides to test five specimens at each 
level of cotton content. 

This is an example of a single-factor experiment with a = 5 levels of the factor and 
n- 5 replicates. The 25 runs should be made in random order. To illustrate how the 
run order may be randomized, suppose that we number the runs as follows: 

60 
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Cotton 

Weight 

Percentage 


Experimental Run Number 


15 

1 

2 

3 

4 

5 

20 

6 

7 

8 

9 

10 

25 

11 

12 

13 

14 

15 

30 

16 

17 

18 

19 

20 

35 

21 

22 

23 

24 

25 


Now we select a random number between 1 and 25. Suppose this number is 8. Then the 
number 8 observation (20 percent cotton) is run first. This process would be repeated 
until all 25 observations have been assigned a position in the test sequence. 1 Many 
computer software packages for assisting experimenters in selecting and constructing a 
design randomize the run order by using random numbers in this fashion. 

Suppose that the test sequence obtained is 


Test Sequence 

Run Number 

Cotton Weight Percentage 

1 

8 

20 

2 

18 

30 

3 

10 

20 

4 

23 

35 

5 

17 

30 

6 

5 

15 

7 

14 

25 

8 

6 

20 

9 

15 

25 

10 

20 

30 

11 

9 

20 

12 

4 

15 

13 

12 

25 

14 

7 

20 

15 

1 

15 

16 

24 

35 

17 

21 

35 

18 

11 

25 

19 

2 

15 

20 

13 

25 

21 

22 

35 

22 

16 

30 

23 

25 

35 

24 

19 

30 

25 

3 

15 


This randomized test sequence is necessary to prevent the effects of unknown nuisance 
variables, perhaps varying out of control during the experiment, from contaminating the 


1 The only restriction on randomization here is that if the same number (e.g., 8) is drawn again, it is discarded. This 
is a minor restriction and is ignored. 
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Table 3-1 Data (in lb/in 2 ) from the Tensile Strength Experiment 


Cotton 

Weight 

Percentage 


Observations 


Total 

Average 

1 

2 

3 

4 

5 

15 

7 

7 

15 

11 

9 

49 

9.8 

20 

12 

17 

12 

18 

18 

77 

15.4 

25 

14 

18 

18 

19 

19 

88 

17.6 

30 

19 

25 

22 

19 

23 

108 

21.6 

35 

7 

10 

11 

15 

11 

54 

10.8 







376 

15.04 


results. To illustrate, suppose that we were to run the 25 test specimens in the original 
nonrandomized order (that is, all five 15 percent cotton specimens are tested first, all five 
20 percent cotton specimens are tested next, and so on). If the tensile strength testing 
machine exhibits a warm-up effect such that the longer it is on, the lower the observed 
tensile strength readings will be, the warm-up effect will potentially contaminate the 
tensile strength data and destroy the validity of the experiment. 

Suppose that the engineer runs the test in the random order we have determined. 
The observations that she obtains on tensile strength are shown in Table 3-1. 

It is always a good idea to examine experimental data graphically. Figure 3-1 pre¬ 
sents box plots for tensile strength at each level of cotton weight percent, and Figure 3-2 
(on the facing page) is a scatter diagram of tensile strength versus cotton weight per¬ 
centage. In Figure 3-2, the solid dots are the individual observations and the open circles 
are the average observed tensile strengths. Both graphs indicate that tensile strength 
increases as cotton content increases, up to about 30 percent cotton. Beyond 30 percent 
cotton, there is a marked decrease in tensile strength. There is no strong evidence to 
suggest that the variability in tensile strength around the average depends on the cotton 
weight percentage. Based on this simple graphical analysis, we strongly suspect that (1) 
cotton content affects tensile strength and (2) around 30 percent cotton results in maxi¬ 
mum strength. 

Suppose that we wish to be more objective in our analysis of the data. Specifically, 



15 20 25 30 35 40 

Cotton weight percentage 


Figure 3-1 Box plots of tensile strength versus cotton weight 
percentage. 
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1 20 


15 20 25 30 35 

Cotton weight percent 

Figure 3-2 Scatter diagram of tensile strength versus 
cotton weight percentage. 


suppose that we wish to test for differences between the mean strengths at all a = 5 
levels of cotton weight percentage. Thus, we are interested in testing the equality of all 
five means. It might seem that this problem could be solved by performing a f-test for 
all the possible pairs of means. However, this is not the best solution to this problem 
because it would lead to considerable distortion in the type I error. For example, suppose 
we wish to test the equality of the five means using pairwise comparisons. There are 10 
possible pairs, and if the probability of correctly accepting the null hypothesis for each 
individual test is 1 - a = .95, the probability of correctly accepting the null hypothesis 
for all 10 tests is (.95) 10 = .60 if the tests are independent. Thus, a substantial increase 
in the type I error has occurred. 

The appropriate procedure for testing the equality of several means is the analysis 
of variance. However, the analysis of variance has a much wider application than the 
problem above. It is probably the most useful technique in the field of statistical inference. 


3-2 THE ANALYSIS OF VARIANCE 

Suppose we have a treatments or different levels of a single factor that we wish to 
compare. The observed response from each of the a treatments is a random variable. The 
data would appear as in Table 3-2. An entry in Table 3-2 (e.g., y,j) represents the jth 
observation taken under factor level or treatment i. There will be, in general, n obser- 


Table 3-2 

Typical Data for a Single-Factor Experiment 



Treatment 

(level) 


Observations 


Totals 

Averages 

1 

3n 

y ,2 

3i» 

3t. 

3i. 

2 

321 

322 

y 2n 

32. 

32. 

a 

y a , 

3*2 

y„n 

3a. 

3a. 





3.. 

3.. 
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vations under the ith treatment. Notice that Table 3-2 is the general case of the data from 
the tensile strength experiment in Table 3-1. 


Models for the Data 

We will find it useful to describe the observations from an experiment with a model. 
One way to write this model is 


y>j 


[i = 1,2,..., a 
Hi = 1. 2 ,.... n 


(3-1) 


where y 0 - is the yth observation, /a, is the mean of the ith factor level or treatment, and 
e, y is a random error component that incorporates all other sources of variability in the 
experiment including measurement, variability arising from uncontrolled factors, differ¬ 
ences between the experimental units (such as test material, etc.) to which the treatments 
are applied, and the general background noise in the process (such as variability over 
time, effects of environmental variables, and so forth). It is convenient to think of the 
errors as having mean zero, so that £(y,y) = /a,. 

Equation 3-1 is called the means model. An alternative way to write a model for 
the data is to define 


/a, = /a + t„ i = 1, 2 ,..., a 
so that Equation 3-1 becomes 

y u = /a + t,- + 

In this form of the model, /a is a parameter common to all treatments called the overall 
mean, and r, is a parameter unique to the ith treatment called the ith treatment effect. 
Equation 3-2 is usually called the effects model. 

Both the means model and the effects model are linear statistical models; that is, 
the response variable is a linear function of the model parameters. Although both 
forms of the model are useful, the effects model is more widely encountered in the 
experimental design literature. It has some intuitive appeal in that /u, is a constant and 
the treatment effects r, represent deviations from this constant when the specific treat¬ 
ments are applied. 

Equation 3-2 (or 3-1) is also called the one-way or single-factor analysis of vari¬ 
ance model because only one factor is investigated. Furthermore, we will require that 
the experiment be performed in random order so that the environment in which the 
treatments are applied (often called the experimental units) is as uniform as possible. 
Thus, the experimental design is a completely randomized design. Our objectives will 
be to test appropriate hypotheses about the treatment means and to estimate them. For 
hypothesis testing, the model errors are assumed to be normally and independently dis¬ 
tributed random variables with mean zero and variance a 2 . The variance cr 2 is assumed 
to be constant for all levels of the factor. This implies that the observations 

y,j ~ N(fi + T h a 2 ) 

and that the observations are mutually independent. 


-C: 


1 , 2 ,. 
1 , 2 ,. 


(3-2) 
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Fixed or Random Factor? 

The statistical model, Equation 3-2, describes two different situations with respect to the 
treatment effects. First, the a treatments could have been specifically chosen by the 
experimenter. In this situation we wish to test hypotheses about the treatment means, 
and our conclusions will apply only to the factor levels considered in the analysis. The 
conclusions cannot be extended to similar treatments that were not explicitly considered. 
We may also wish to estimate the model parameters (/a, t„ a 1 ). This is called the fixed 
effects model. Alternatively, the a treatments could be a random sample from a larger 
population of treatments. In this situation we should like to be able to extend the con¬ 
clusions (which are based on the sample of treatments) to all treatments in the population, 
whether they were explicitly considered in the analysis or not. Here the r, are random 
variables, and knowledge about the particular ones investigated is relatively useless. 
Instead, we test hypotheses about the variability of the r, and try to estimate this vari¬ 
ability. This is called the random effects model or components of variance model. We 
will defer discussion of experiments with random factors to Chapter 12. 


3-3 ANALYSIS OF THE FIXED EFFECTS MODEL 


In this section, we develop the single-factor analysis of variance for the fixed effects 
model. Recall that y L represents the total of the observations under the ith treatment. Let 
y,. represent the average of the observations under the ith treatment. Similarly, let y„ 
represent the grand total of all the observations andy„ represent the grand average of all 
the observations. Expressed symbolically, 


y,-. = E y>j’ y >. = yJ n * = L 2 a 

y.. = ii y-j % = %JN 


(3-3) 


where N = an is the total number of observations. We see that the “dot” subscript 
notation implies summation over the subscript that it replaces. 

We are interested in testing the equality of the a treatment means; that is, £(y,y) = 
p, + r, = Hi, i = 1, 2,..., a. The appropriate hypotheses are 


Ho'.fli = 1*2= ■ = Ha 

Hi : Hi ^ Hj for at least one pair (i, j) 

In the effects model, we break the ith treatment mean h, up into two components such 
that Hi = H + T i- We usually think of p. as an overall mean so that 


2 Mi 


a 


H 


This definition implies that 
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That is, the treatment or factor effects can be thought of as deviations from the overall 
mean . 2 Consequently, an equivalent way to write the above hypotheses is in terms of the 
treatment effects 77 , say 

Ho'.Ti = t 2 = ■ T a = 0 

Hi : 77 =£ 0 for at least one i 

Thus, we speak of testing the equality of treatment means or testing that the treatment 
effects (the 77 ) are zero. The appropriate procedure for testing the equality of a treatment 
means is the analysis of variance. 


3-3.1 Decomposition of the Total Sum of Squares 

The name analysis of variance is derived from a partitioning of total variability into its 
component parts. The total corrected sum of squares 

ssr = E E ( yu - y ..) 2 

is used as a measure of overall variability in the data. Intuitively, this is reasonable 
because, if we were to divide SS T by the appropriate number of degrees of freedom (in 
this case, an — 1 = N - 1), we would have the sample variance of the y’s. The sample 
variance is, of course, a standard measure of variability. 

Note that the total corrected sum of squares SS T may be written as 

E E (y<, - yd 2 = E E Ky, - y..) + (y u ~ yd? (3-4) 

or 

E E (y-j - y.f = n J) (y, - y..) 2 + i i (y.j - y,) 2 

+ 2 2 E (y, - y..)(y,y - yd (3-5) 

However, the cross-product term in Equation 3-5 is zero, because 

E (yu ~ yd = y,. - «y.-. = y«. - «(y,./«) = o 

Therefore, we have 

E E (yu - yd 2 = «E (y, - yd 2 + E E <y u - yd 2 (3-6) 

Equation 3-6 states that the total variability in the data, as measured by the total corrected 
sum of squares, can be partitioned into a sum of squares of the differences between the 
treatment averages and the grand average, plus a sum of squares of the differences of 
observations within treatments from the treatment average. Now, the difference between 




information on this subject, refer to the supplemental text material for Chapter 3. 
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the observed treatment averages and the grand average is a measure of the differences 
between treatment means, whereas the differences of observations within a treatment 
from the treatment average can be due only to random error. Thus, we may write Equation 
3-6 symbolically as 

SS, = SS Treatments + SS E 

where SS Treatments is called the sum of squares due to treatments (i.e., between treatments), 
and SS F is called the sum of squares due to error (i.e., within treatments). There are an = 
N total observations; thus, SS T has N — 1 degrees of freedom. There are a levels of the 
factor (and a treatment means), so SS Treat ments has a - 1 degrees of freedom. Finally, 
within any treatment there are n replicates providing n — 1 degrees of freedom with 
which to estimate the experimental error. Because there are a treatments, we have a(n - 
1 ) = an — a = N — a degrees of freedom for error. 

It is helpful to examine explicitly the two terms on the right-hand side of the fun¬ 
damental analysis of variance identity (Equation 3-6). Consider the error sum of squares 

SS E = J) 2 (y v - y,-.) 2 = 2 [| (y.j - y,) 2 J 

In this form it is easy to see that the term within square brackets, if divided by n - 1, is 
the sample variance in the ith treatment, or 

2 (y<i ~ y>f 

Sf = j= ' n — - i = 1, 2.a 

Now a sample variances may be combined to give a single estimate of the common 
population variance as follows: 

2 f 2 (y v ~ yJ 2 ] 

(n - 1)5? + (n - \)S\ + • • • + (ti - 1)S 2 = 1-1 _£ 

(n - 1) + (n - 1) + ••• + («- 1) X (« - 1) 

_ SS E 
~ (N-a) 

Thus, SS e /(N - a) is a pooled estimate of the common variance within each of the a 
treatments. 

Similarly, if there were no differences between the a treatment means, we could use 
the variation of the treatment averages from the grand average to estimate cr 2 . 
Specifically, 


is an estimate of a 2 if the treatment means are equal. The reason for this may be intui¬ 
tively seen as follows: The quantity 2f=i(y,. — y..) 2 /(a — 1) estimates a 2 /n, the variance 
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of the treatment averages, so raS“ = 1 (;y,. - yj 2 /(a - 1 ) must estimate a 2 if there are no 
differences in treatment means. 

We see that the analysis of variance identity (Equation 3-6) provides us with two 
estimates of a 2 —one based on the i nh erent variability within treatments and one based 
on the variability between treatments. If there are no differences in the treatment means, 
these two estimates should be very similar, and if they are not, we suspect that the 
observed difference must be caused by differences in the treatment means. Although we 
have used an intuitive argument to develop this result, a somewhat more formal approach 
can be taken. 

The quantities 


MS Trea 


‘^‘^Treatmei 

a - 1 


and 


MS e 


SS e 
N - a 


are called mean squares. We now examine the expected values of these mean squares. 
Consider 

= i ( yfj - 2y i jy i . + 5ft] 

Substituting the model (Equation 3-1) into this equation, we obtain 

E(MS f ) = -^— a e[| £ (ll + T, + €,/ - - n i AC + T, + €,j) ] 

Now when squaring and taking expectation of the quantity within the brackets, we see 
that terms involving e? and e 2 are replaced by cr 2 and ncr 2 , respectively, because 
E(e,y) = 0. Furthermore, all cross-products involving e, y have zero expectation. Therefore, 
after squaring and taking expectation, the last equation becomes 

E(MS e ) = — \ a j N/J, 2 + n 5) t? + No 2 — NfJb 2 - n ^ t 2 - aer 2 ] 


or 


E(MSe) = cr 2 
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By a similar approach, we may also show that 3 

n 2 7? 

£(MS Treannents ) = o- 2 + - 

Thus, as we argued heuristically, MS E = SS /: /(N - a) estimates a 2 , and, if there are no 
differences in treatment means (which implies that r, = 0), MS Treatments = SS TrcMmcnt J 
(a - 1) also estimates a 2 . However, note that if treatment means do differ, the expected 
value of the treatment mean square is greater than a 2 . 

It seems clear that a test of the hypothesis of no difference in treatment means can 
be performed by comparing MS Treatmen ts and MS E . We now consider how this comparison 
may be made. 


3-3.2 Statistical Analysis 

We now investigate how a formal test of the hypothesis of no differences in treatment 
means (H 0 :fjt, = p , 2 = ■ • • = fi a , or equivalently, H 0 : Ti = r 2 = • • • T a = 0) can be 
performed. Because we have assumed that the errors e,y are normally and independently 
distributed with mean zero and variance a 2 , the observations y tJ are normally and inde¬ 
pendently distributed with mean /r. + t, and variance a 2 . Thus, SS T is a sum of squares 
in normally distributed random variables; consequently, it can be shown that SS T /a- 2 is 
distributed as chi-square with N - 1 degrees of freedom. Furthermore, we can show that 
SS e /(t 2 is chi-square with N — a degrees of freedom and that 5S Tre atmei«s/<T 2 is chi-square 
with a — 1 degrees of freedom if the null hypothesis H 0 :t { = 0 is true. However, all 
three sums of squares are not necessarily independent because SS Treatments and SS E add to 
SS T . The following theorem, which is a special form of one attributed to William Coch¬ 
ran, is useful in establishing the independence of SS E and SS Trea t m ents- 

THEOREM 3-1 . 

Cochran’s Theorem 

Let Z, be NID(0, 1) for i = 1, 2,.... v and 

2 Z? = Qx + Q 2 + ■ ■ ■ + Qs 

where s ^ v, and Q, has v, degrees of freedom (i = 1, 2,..., s). Then Q x , Q 2 ,..., Q„ 
are independent chi-square random variables with v u v 2 ,..., v s degrees of freedom, 
respectively, if and only if 

v = v t + v 2 + ■■■ + v s 


Because the degrees of freedom for SS Treatments and SS E add to N - 1, the total number 
of degrees of freedom, Cochran’s theorem implies that 55 Tre atments/<r 2 and SS E /a 2 are 


5 Refer to the supplemental text material for Chapter 3. 
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Table 3-3 The Analysis of Variance Table for the Single-Factor, Fixed Effects Model 


Source of Variation 

Sum of 

Squares 

Degrees of 
Freedom 

Square 

F 0 






Between treatments 

= n i (y, - y.f 

a - 1 

A/S Treatments 

p ^*^Treatments 

0 ms e 

Error (within 
treatments) 

SS E = SSt - SSt . . 

N-a 

ms e 


Total 

SSt = X X (y,y - y..) 2 

N - 1 




independently distributed chi-square random variables. Therefore, if the null hypothesis 
of no difference in treatment means is true, the ratio 

P — ^Treatments Kfl ~ 1) _ M ^Treatments 

0 SS e /(N -a) MS e { j 

is distributed as F with a - 1 and N - a degrees of freedom. Equation 3-7 is the test 
statistic for the hypothesis of no differences in treatment means. 

From the expected mean squares we see that, in general, MS E is an unbiased estimator 
of or 2 . Also, under the null hypothesis, MS Treatments is an unbiased estimator of a 2 . How¬ 
ever, if the null hypothesis is false, the expected value of MS Treatments is greater than a 2 . 
Therefore, under the alternative hypothesis, the expected value of the numerator of the 
test statistic (Equation 3-7) is greater than the expected value of the denominator, and 
we should reject H 0 on values of the test statistic that are too large. This implies an upper- 
tail, one-tail critical region. Therefore, we should reject H 0 and conclude that there are 
differences in the treatment means if 

F’o > Fa,a-l,N-a 

where F 0 is computed from Equation 3-7. Alternatively, we could use the P-value ap¬ 
proach for decision making. 

Computing formulas for the sums of squares may be obtained by rewriting and 
simplifying the definitions of MS Treatments and SS T in Equation 3-6. This yields 

SSt = i i yfj - | (3-8) 

;=l j= i TV 

and 

^Treatments = \ i yf. - (3-9) 

The error sum of squares is obtained by subtraction as 

SS E = SS T ~ SS Treatments (3-10) 

The test procedure is summarized in Table 3-3 above. This is called an analysis of 
variance table. 

EXAMPLE 3-1 . 

The Tensile Strength Experiment 

To illustrate the analysis of variance, return to the example first discussed in Section 3-1. 
Recall that the development engineer is interested in determining if the cotton weight 
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percentage in a synthetic fiber affects the tensile strength, and she has run a completely 
randomized experiment with five levels of cotton weight percentage and five replicates. 
For convenience, we repeat the data from Table 3-1 here: 


Weight 
Percentage 
of Cotton 


Observed Tensile Strength 
(lb/in 2 ) 



Averages 

y,. 

1 

2 

3 

4 

5 

y,. 

15 

7 

7 

15 

11 

9 

49 

9.8 

20 

12 

17 

12 

18 

18 

77 

15.4 

25 

14 

18 

18 

19 

19 

88 

17.6 

30 

19 

25 

22 

19 

23 

108 

21.6 

35 

7 

10 

11 

15 

11 

54 

y_ = 376 

10.8 

y„ = 15.04 


We will use the analysis of variance to test H 0 : = fi 2 = = /x 4 = fi 5 against 

the alternative : some means are different. The sums of squares required are computed 
as follows: 

5 5 2 

SS T =2J i yfj- y - 

= (7) 2 + (7) 2 + (15) 2 + • ■ ■ + (15) 2 + (ll) 2 - = 636.96 

ss Treatments = - X yf. ~ ^ 

= \ [(49) 2 + • • • + (54) 2 ] - = 475.76 

SSe = SSj- >5Sxreatments 

= 636.96 - 475.76 = 161.20 

Usually, these calculations would be performed on a computer, using a software package 
with the capability to analyze data from designed experiments. 

The analysis of variance is summarized in Table 3-4. Note that the between-treatment 
mean square (118.94) is many times larger than the within-treatment or error mean square 
(8.06). This indicates that it is unlikely that the treatment means are equal. More formally, 
we can compute the F ratio F 0 = 118.94/8.06 = 14.76 and compare this to an appropriate 
upper-tail percentage point of the F 4 2 o distribution. Suppose that the experimenter has 
selected a = 0.05. From Appendix Table IV we find that Fo. 05 , 4,20 = 2.87. Because 
F 0 = 14.76 > 2.87, we reject H 0 and conclude that the treatment means differ; that is, 


Table 3-4 Analysis of Varianc e for the Tensile Strength Data _ 

Sum of Degrees of Mean 

Source of Variation Squares_ Freedom _ Square _ Fo _ P-Value 

Cotton weight percentage 475.76 4 118.94 F 0 = 14.76 <0.01 

Error 161.20 20 8.06 

Total 636.96 24 
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the cotton weight percentage in the fiber significantly affects the mean tensile strength. 
We could also compute a P-value for this test statistic. Figure 3-3 shows the reference 
distribution (F 4>2 o) for the test statistic F 0 . Clearly, the P-value is very small in this case. 
Because F 0 .oi,4.2o = 4.43 and F 0 > 4.43, we can conclude that an upper bound for the 
P-value is 0.01; that is, P < 0.01 (the exact P-value is P = 9.11 X 10 -6 ). 


Manual Computations 

The reader has likely noted that we defined sum of squares in terms of averages; that is, 
from Equation 3-6, 


SSj rea 


■»2 <*. - L > 2 


but we developed the computing formulas using totals. For example, to compute 
SSVreatmenis, we would use Equation 3-9: 


= 1 i^- 


£ 

N 


The primary reason for this is convenience; furthermore, the totals y L and y__ are not as 
subject to rounding error as are the averages y L and y_. 

Generally, we need not be too concerned with computing, because there are many 
widely available computer programs for performing the calculations. These computer 
programs are also helpful in performing many other analyses associated with experi¬ 
mental design (such as residual analysis and model adequacy checking). In many cases, 
these programs will also assist the experimenter in setting up the design. 

When hand calculations are necessary, it is sometimes helpful to code the observa¬ 
tions. This is illustrated in the next example. 


EXAMPLE 3-2 . 

Coding the Observations 

The calculations in the analysis of variance may often be made more accurate or sim¬ 
plified by coding the observations. For example, consider the tensile strength data in 
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Table 3-5 Coded Tensile Strength Data for Example 3-2 


Cotton 

Weight 

Percentage 



Observations 



Totals 

y, 

1 

2 

3 

4 

5 

15 

-8 

-8 

0 

-4 

-6 

-26 

20 

-3 

2 

-3 

3 

3 

2 

25 

-1 

3 

3 

4 

4 

13 

30 

4 

10 

7 

4 

8 

33 

35 

-8 

-5 

-4 

0 

-4 

-21 


Example 3-1. Suppose we subtract 15 from each observation. The coded data are shown 
in Table 3-5. It is easy to verify that 


SS T = (—8) 2 + (—8) 2 + • • • + (—4) 2 


SS Tre , 


(—26) 2 + (2) 2 + • ■ • + (-21) 2 
5 


(if 

25 

Of 

25 


= 636.96 
= 475.76 


and 

SS E = 161.20 

Comparing these sums of squares to those obtained in Example 3-1, we see that sub¬ 
tracting a constant from the original data does not change the sums of squares. 

Now suppose that we multiply each observation in Example 3-1 by 2. It is easy to 
verify that the sums of squares for the transformed data are SS T = 2547.84, SS Trea tments 
= 1903.04, and SS E = 644.80. These sums of squares appear to differ considerably from 
those obtained in Example 3-1. However, if they are divided by 4 (i.e., 2 2 ), the results 
are identical. For example, for the treatment sum of squares 1903.04/4 = 475.76. Also, 
for the coded data, the F ratio is F = (1903.04/4)/(644.80/20) = 14.76, which is identical 
to the F ratio for the original data. Thus, the analyses of variance are equivalent. 


Randomization Tests and Analysis of Variance 

In our development of the F test analysis of variance, we have used the assumption that 
the random errors e, ; are normally and independently distributed random variables. The 
F test can also be justified as an approximation to a randomization test. To illustrate, 
suppose that we have five observations on each of two treatments and that we wish to 
test the equality of treatment means. The data would look like this: 

Treatment 1 Treatment 2 

Yu yu 

3*12 3*22 

Yi3 Y23 

yi4 y 24 

yis 3*25 

We could use the F test analysis of variance to test // 0 :/A| = jx 2 . Alternatively, we could 
use a somewhat different approach. Suppose we consider all the possible ways of alio- 
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eating the 10 numbers in the sample above to the two treatments. There are 10!/5!5! = 
252 possible arrangements of the 10 observations. If there is no difference in treatment 
means, all 252 arrangements are equally likely. For each of the 252 arrangements, we 
calculate the value of the F statistic using Equation 3-7. The distribution of these F 
values is called a randomization distribution, and a large value of F indicates that the 
data are not consistent with the hypothesis // 0 : /a, = /a 2 . For example, if the value of 
F actually observed was exceeded by only 5 of the values of the randomization distribu¬ 
tion, this would correspond to rejection of // 0 : yu., = ^ at a significance level of a = 
5/252 = .0198 (or 1.98 percent). Notice that no normality assumption is required in this 
approach. 

The difficulty with this approach is that, even for relatively small problems, it 
is computationally prohibitive to enumerate the exact randomization distribution. How¬ 
ever, numerous studies have shown that the exact randomization distribution is well- 
approximated by the usual normal-theory F distribution. Thus, even without the 
normality assumption, the F test can be viewed as an approximation to the randomization 
test. For further reading on randomization tests in the analysis of variance, see Box, 
Hunter, and Hunter (1978). 


3-3.3 Estimation of the Model Parameters 

We now present estimators for the parameters in the single-factor model 

y*t = F + r, , + €„• 

and confidence intervals on the treatment means. We will prove later that reasonable 
estimates of the overall mean and the treatment effects are given by 

A = y~ (3-11) 

% = y L - y.., i = 1, 2,.... a 

These estimators have considerable intuitive appeal; note that the overall mean is esti¬ 
mated by the grand average of the observations and that any treatment effect is just the 
difference between the treatment average and the grand average. 

A confidence interval estimate of the ith treatment mean may be easily determined. 
The mean of the ith treatment is 


/Jii = (JL + T, 

A point estimator of ju,, would be /&, = (jl + % = y L . Now, if we assume that the errors 
are normally distributed, each y L is NID(/x.„ a 2 In). Thus, if cr 2 were known, we could 
use the normal distribution to define the confidence interval. Using the MS h as an esti¬ 
mator of cr 2 , we would base the confidence interval on the t distribution. Therefore, a 
100(1 — a) percent confidence interval on the ith treatment mean /c, is 



A 100(1 - a) percent confidence interval on the difference in any two treatments means, 
say fJL, - /ij , would be 


- % + t a . 


(3-13) 
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EXAMPLE 3-3 . 

Using the data in Example 3-1, we may find the estimates of the overall mean and the 
treatment effects as (L = 376/25 = 15.04 and 

fi = y u - y„ = 9.80 - 15.04 = -5.24 

t 2 = y 2 . - y.. = 15.40 - 15.04 = +0.36 

r 3 = y 3 . - y.f = 17.60 - 15.04 = -2.56 

t 4 = %. - y.. = 21.60 - 15.04 = +6.56 

t 5 = y 5 . - y.. = 10.80 - 15.04 = -4.24 


A 95 percent confidence interval on the mean of treatment 4 (30 percent cotton) is 
computed from Equation 3-12 as 


21.60 - 2.086 


or 


21.60 - 2.65 =S fi 4 =£ 21.60 + 2.65 
Thus, the desired 95 percent confidence interval is 18.95 =£ fi 4 =£ 24.25. 


Simultaneous Confidence Intervals 

The confidence interval expressions given in Equations 3-12 and 3-13 are one-at-a-time 
confidence intervals. That is, the confidence level 1 - a applies to only one particular 
estimate. However, in many problems, the experimenter may wish to calculate several 
confidence intervals, one for each of a number of means or differences between means. 
If there are r such 100(1 - a) percent confidence intervals of interest, the probability 
that the r intervals will simultaneously be correct is at least 1 - ra. The probability ra 
is often called the experimentwise error rate or overall confidence coefficient. The 
number of intervals r does not have to be large before the set of confidence intervals 
becomes relatively uninformative. For example, if there are r = 5 intervals and a = 0.05 
(a typical choice), the simultaneous confidence level for the set of five confidence inter¬ 
vals is at least 0.75, and if r = 10 and a = 0.05 the simultaneous confidence level is at 
least 0.50. 

One approach to ensuring that the simultaneous confidence level is not too small is 
to replace a/2 in the one-at-a-time confidence interval Equations 3-12 and 3-13 with 
a/(2r). This is called the Bonferroni method, and it allows the experimenter to construct 
a set of r simultaneous confidence intervals on treatment means or differences in treat¬ 
ment means for which the overall confidence level is at least 100(1 - a) percent. When 
r is not too large, this is a very nice method that leads to reasonably short confidence 
intervals. For more information refer to the supplemental text material for Chapter 3. 


3-3.4 Unbalanced Data 

In some single-factor experiments the number of observations taken within each treat¬ 
ment may be different. We then say that the design is unbalanced. The analysis of 
variance described above may still be used, but slight modifications must be made in the 
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sum of squares formulas. Let n, observations be taken under treatment i (i — 1,2,, 
a) and N = 2? =1 The manual computational formulas for SS, and SS Tre atments become 

SSt = i i yl - | (3-14) 

and 


y yl_£ 

h n, N 


(3-15) 


No other changes are required in the analysis of variance. 

There are two advantages in choosing a balanced design. First, the test statistic is 
relatively insensitive to small departures from the assumption of equal variances for the 
a treatments if the sample sizes are equal. This is not the case for unequal sample sizes. 
Second, the power of the test is maximized if the samples are of equal size. 


3-4 MODEL ADEQUACY CHECKING 

The decomposition of the variability in the observations through an analysis of variance 
identity (Equation 3-6) is a purely algebraic relationship. However, the use of the par¬ 
titioning to test formally for no differences in treatment means requires that certain as¬ 
sumptions be satisfied. Specifically, these assumptions are that the observations are 
adequately described by the model 

ytj = ii + T t + €,y 

and that the errors are normally and independently distributed with mean zero and con¬ 
stant but unknown variance cr 2 . If these assumptions are valid, the analysis of variance 
procedure is an exact test of the hypothesis of no difference in treatment means. 

In practice, however, these assumptions will usually not hold exactly. Consequently, 
it is usually unwise to rely on the analysis of variance until the validity of these as¬ 
sumptions has been checked. Violations of the basic assumptions and model adequacy 
can be easily investigated by the examination of residuals. We define the residual for 
observation j in treatment i as 

e ij = yij ~ Sij (3-16) 

where y tj is an estimate of the corresponding observation y,j obtained as follows: 

Su = 4 + f,- 

= y.. + in - y.) 

= Si. (3-17) 

Equation 3-17 gives the intuitively appealing result that the estimate of any observation 
in the ith treatment is just the corresponding treatment average. 

Examination of the residuals should be an automatic part of any analysis of variance. 
If the model is adequate, the residuals should be structureless; that is, they should 
contain no obvious patterns. Through a study of residuals, many types of model inade¬ 
quacies and violations of the underlying assumptions can be discovered. In this section, 
we show how model diagnostic checking can be done easily by graphical analysis of 
residuals and how to deal with several commonly occurring abnormalities. 
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3-4.1 The Normality Assumption 

A check of the normality assumption could be made by plotting a histogram of the 
residuals. If the NID(0, a 2 ) assumption on the errors is satisfied, this plot should look 
like a sample from a normal distribution centered at zero. Unfortunately, with small 
samples, considerable fluctuation often occurs, so the appearance of a moderate departure 
from normality does not necessarily imply a serious violation of the assumptions. Gross 
deviations from normality are potentially serious and require further analysis. 

An extremely useful procedure is to construct a normal probability plot of the resid¬ 
uals. Recall that in Chapter 2 we used a normal probability plot of the raw data to check 
the assumption of normality when using the t-test. In the analysis of variance, it is usually 
more effective (and straightforward) to do this with the residuals. If the underlying error 
distribution is normal, this plot will resemble a straight line. In visualizing the straight 
line, place more emphasis on the central values of the plot than on the extremes. 

Table 3-6 shows the original data and the residuals for the tensile strength data in 
Example 3-1. The normal probability plot is shown in Figure 3-4 on the next page. The 
general impression from examining this display is that the error distribution may be 
slightly skewed, with the right tail being longer than the left. The tendency of the normal 
probability plot to bend down slightly on the left side implies that the left tail of the error 
distribution is somewhat thinner than would be anticipated in a normal distribution; that 
is, the negative residuals are not quite as large (in absolute value) as expected. This plot 
is not grossly nonnormal, however. 

In general, moderate departures from normality are of little concern in the fixed 
effects analysis of variance (recall our discussion of randomization tests in Section 
3-3.2). An error distribution that has considerably thicker or thinner tails than the normal 
is of more concern than a skewed distribution. Because the F test is only slightly affected, 
we say that the analysis of variance (and related procedures such as multiple comparisons) 
is robust to the normality assumption. Departures from normality usually cause both the 
true significance level and the power to differ slightly from the advertised values, with 


Table 3-6 Data and Residuals from Example 3-1" 


Cotton 

1 

2 

3 

4 

5 

h = % 


|-2.8 

|-2.8 

| 5.2 

1 1.2 

|-0.8 


15 

7 (15) 

7 (19) 

15 (25) 

11 (12) 

9 (6) 

9.8 


|-3.4 

1 1.6 

|-3.4 

1 2.6 

1 2.6 


20 

12 (8) 

17 (14) 

12 (1) 

18 (11) 

19 (3) 

15.4 


1 - 3-6 

1 0.4 

| 0.4 

| 1.4 

| 1.4 


25 

14 (18) 

18 (13) 

18 (20) 

19 (7) 

19 (9) 

17.6 


[-2.6 

1 3.4 

| 0.4 

|-2.6 

1 1.4 


30 

19 (22) 

25 (5) 

22 (2) 

19 (24) 

23 (10) 

21.6 


|-3.8 

|—0.8 

| 0.2 

1 4.2 

| 0.2 


35 

7 (17) 

10 (21) 

11 (4) 

15 (16) 

11 (23) 

10.8 


" The residuals are shown in the box in each cell. The numbers in parentheses indicate the order 
collection. 
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-3.8 -1.55 0.7 2.95 5.2 

Residual 

Figure 3-4 Normal probability plot of residuals for Example 3-1. 


the power generally being lower. The random effects model that we will discuss in 
Chapter 12 is more severely affected by nonnormality. 

A very common defect that often shows up on normal probability plots is one residual 
that is very much larger than any of the others. Such a residual is often called an outlier. 
The presence of one or more outliers can seriously distort the analysis of variance, so 
when a potential outlier is located, careful investigation is called for. Frequently, the 
cause of the outlier is a mistake in calculations or a data coding or copying error. If this 
is not the cause, the experimental circumstances surrounding this run must be carefully 
studied. If the outlying response is a particularly desirable value (high strength, low cost, 
etc.), the outlier may be more informative than the rest of the data. We should be careful 
not to reject or discard an outlying observation unless we have reasonably nonstatistical 
grounds for doing so. At worst, you may end up with two analyses; one with the outlier 
and one without. 

There are several formal statistical procedures for detecting outliers [e.g., see Barnett 
and Lewis (1994), John and Prescott (1975), and Stefansky (1972)]. A rough check for 
outliers may be made by examining the standardized residuals 

e ij 

dii = Vm E 


(3-18) 
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If the errors e„ 7 are N( 0, a 2 ), the standardized residuals should be approximately normal 
with mean zero and unit variance. Thus, about 68 percent of the standardized residuals 
should fall within the limits ±1, about 95 percent of them should fall within ±2, and 
virtually all of them should fall within ±3. A residual bigger than 3 or 4 standard de¬ 
viations from zero is a potential outlier. 

For the tensile strength data of Example 3-1, the normal probability plot gives no 
indication of outliers. Furthermore, the largest standardized residual is 
_ _ _ 52 _ _ 52 _ 

13 " VMS, ~ V^06 2.84 

which should cause no concern. 


3-4.2 Plot of Residuals in Time Sequence 

Plotting the residuals in time order of data collection is helpful in detecting correlation 
between the residuals. A tendency to have runs of positive and negative residuals indi¬ 
cates positive correlation. This would imply that the independence assumption on the 
errors has been violated. This is a potentially serious problem and one that is difficult to 
correct, so it is important to prevent the problem if possible when the data are collected. 
Proper randomization of the experiment is an important step in obtaining independence. 

Sometimes the skill of the experimenter (or the subjects) may change as the exper¬ 
iment progresses, or the process being studied may “drift” or become more erratic. This 
will often result in a change in the error variance over time. This condition often leads 
to a plot of residuals versus time that exhibits more spread at one end than at the other. 
Nonconstant variance is a potentially serious problem. We will have more to say on the 
subject in Sections 3-4.3 and 3-4.4. 

Table 3-6 displays the residuals and the time sequence of data collection for the 
tensile strength data. A plot of these residuals versus time is shown in Figure 3-5. There 
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Figure 3-5 Plot of residuals versus time. 
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is no reason to suspect any violation of the independence or constant variance 
assumptions. 


3-4.3 Plot of Residuals Versus Fitted Values 

If the model is correct and if the assumptions are satisfied, the residuals should be struc¬ 
tureless; in particular, they should be unrelated to any other variable including the pre¬ 
dicted response. A simple check is to plot the residuals versus the fitted values y,y. (For 
the single-factor experiment model, remember that y tj = y,., the fth treatment average.) 
This plot should not reveal any obvious pattern. Figure 3-6 plots the residuals versus the 
fitted values for the tensile strength data of Example 3-1. No unusual structure is apparent. 

A defect that occasionally shows up on this plot is nonconstant variance. Sometimes 
the variance of the observations increases as the magnitude of the observation increases. 
This would be the case if the error or background noise in the experiment was a constant 
percentage of the size of the observation. (This commonly happens with many measuring 
instruments—error is a percentage of the scale reading.) If this were the case, the resid¬ 
uals would get larger as y tj gets larger, and the plot of residuals versus y tj would look 
like an outward-opening funnel or megaphone. Nonconstant variance also arises in cases 
where the data follow a nonnormal, skewed distribution because in skewed distributions 
the variance tends to be a function of the mean. 

If the assumption of homogeneity of variances is violated, the F test is only slightly 
affected in the balanced (equal sample sizes in all treatments) fixed effects model. How¬ 
ever, in unbalanced designs or in cases where one variance is very much larger than the 
others, the problem is more serious. Specifically, if the factor levels having the larger 
variances also have the smaller sample sizes, the actual type I error rate is larger than 
anticipated (or confidence intervals have lower actual confidence levels than were spec¬ 
ified). Conversely, if the factor levels with larger variances also have the larger sample 
sizes, the significance levels are smaller than anticipated (confidence levels are higher). 
This is a good reason for choosing equal sample sizes whenever possible. For random 
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Figure 3-6 Plot of residuals versus fitted values. 
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effects models, unequal error variances can significantly disturb inferences on variance 
components even if balanced designs are used. 

The usual approach to dealing with nonconstant variance when it occurs for the above 
reasons is to apply a variance-stabilizing transformation and then to run the analysis 
of variance on the transformed data. In this approach, one should note that the conclusions 
of the analysis of variance apply to the transformed populations. 

Considerable research has been devoted to the selection of an appropriate transfor¬ 
mation. If experimenters know the theoretical distribution of the observations, they may 
utilize this information in choosing a transformation. For example, if the observations 
follow the Poi sson distribution, the square root transformation y* = \fy n or 
y* = Vl + yij would be used. If the data follow the lognormal distribution, the loga¬ 
rithmic transformation y* = log y tj is appropriate. For binomial data expressed as frac¬ 
tions, the arcsin transformation y* = arcsinVy^ is useful. When there is no obvious 
transformation, the experimenter usually empirically seeks a transformation that equal¬ 
izes the variance regardless of the value of the mean. We offer some guidance on this at 
the conclusion of this section. In factorial experiments, which we introduce in Chapter 
5, another approach is to select a transformation that minimizes the interaction mean 
square, resulting in an experiment that is easier to interpret. In Chapter 14, we discuss 
in more detail methods for analytically selecting the form of the transformation. Trans¬ 
formations made for inequality of variance also affect the form of the error distribution. 
In most cases, the transformation brings the error distribution closer to normal. For more 
discussion of transformations, refer to Bartlett (1947), Box and Cox (1964), Dolby 
(1963), and Draper and Hunter (1969). 

Statistical Tests for Equality of Variance 

Although residual plots are frequently used to diagnose inequality of variance, several 
statistical tests have also also been proposed. These tests may be viewed as formal tests 
of the hypotheses 

H 0 :a\ = oi = • • ■ = of 

Hi: above not true for at least one of 

A widely used procedure is Bartlett’s test. The procedure involves computing a 
statistic whose sampling distribution is closely approximated by the chi-square distri¬ 
bution with o—l degrees of freedom when the a random samples are from independent 
normal populations. The test statistic is 

Xo = 2.3026 ^ (3-19) 


where 

q = (N - o)log 10 Sp - E («, - l)log, 0 Sf 

t (», - 1 )S? 

and Sj is the sample variance of the zth population. 
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The quantity q is large when the sample variances Sf differ greatly and is equal to 
zero when all Sf are equal. Therefore, we should reject H 0 on values of \o that are too 
large; that is, we reject H 0 only when 

Xo > Xl.a -1 

where xi.a-i is the upper a percentage point of the chi-square distribution with a - 1 
degrees of freedom. The P-value approach to decision making could also be used. 

Bartlett’s test is very sensitive to the normality assumption. Consequently, when the 
validity of this assumption is doubtful, Bartlett’s test should not be used. 

EXAMPLE 3-4 . 

Because the normality assumption is not in question, we can apply Bartlett’s test to the 
tensile strength data from the cotton weight percentage experiment in Example 3-1. We 
first compute the sample variances in each treatment and find that Sf = 11.2, Sf = 9.8, 
Sj = 4.3, Si = 6.8, and Sf = 8.2. Then 

c2 _ 4(11,2) + 4(9.8) + 4(4.3) + 4(6.8) + 4(8.2) _ 

p 2Q 8.06 

q = 20 log ]O (8.06) - 4[logi 0 11.2 + log 10 9.8 + log 10 4.3 + log 10 6.8 + log 10 8.2] 


= 0.45 



and the test statistic is 


*° = 2 ' 3026 TT5T 093 


Because * 0 . 05,4 = 9.49, we cannot reject the null hypothesis and conclude that all five 
variances are the same. This is the same conclusion reached by analyzing the plot of 
residuals versus fitted values. 


Because Bartlett’s test is sensitive to the normality assumption, there may be situa¬ 
tions where an alternative procedure would be useful. Anderson and McLean (1974) 
present a nice discussion of statistical tests for equality of variance. The modified Levene 
test [see Levene (1960) and Conover, Johnson, and Johnson (1981)] is a very nice pro¬ 
cedure that is robust to departures from normality. To test the hypothesis of equal vari¬ 
ances in all treatments, the modified Levene test uses the absolute deviation of the 
observations y tJ in each treatment from the treatment median, say y, . Denote these de¬ 
viations by 



The modified Levene test then evaluates whether or not the mean of these deviations are 
equal for all treatments. It turns out that if the mean deviations are equal, the variances 
of the observations in all treatments will be the same. The test statistic for Levene’s test 
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Table 3-7 Peak Discharge Data 


Estimation 

Method 



Observations 



%. 

Si 

Si 

1 

0.34 

0.12 

1.23 

0.70 

1.75 

0.12 

0.71 

0.520 

0.66 

2 

0.91 

2.94 

2.14 

2.36 

2.86 

4.55 

2.63 

2.610 

1.09 

3 

6.31 

8.37 

9.75 

6.09 

9.82 

7.24 

7.93 

7.805 

1.66 

4 

17.15 

11.82 

10.95 

17.20 

14.35 

16.82 

14.72 

15.59 

2.77 

Estimation 

Method 


Deviations d tJ 

for the Modified Levene Test 





1 

0.18 

0.40 

0.71 

0.18 

1.23 

0.40 




2 

1.70 

0.33 

0.47 

0.25 

0.25 

1.94 




3 

1.495 

0.565 

1.945 

1.715 

2.015 

0.565 




4 

1.56 

3.77 

4.64 

1.61 

1.24 

1.23 





is simply the usual ANOVA F statistic for testing equality of means applied to the 
absolute deviations. 

EXAMPLE 3-5 . 

A civil engineer is interested in determining whether four different methods of estimating 
flood flow frequency produce equivalent estimates of peak discharge when applied to the 
same watershed. Each procedure is used six times on the watershed, and the resulting 
discharge data (in cubic feet per second) are shown in the upper panel of Table 3-7. The 
analysis of variance for the data, summarized in Table 3-8, implies that there is a differ¬ 
ence in mean peak discharge estimates given by the four procedures. The plot of residuals 
versus fitted values, shown in Figure 3-7 (top of page 84), is disturbing because the 
outward-opening funnel shape indicates that the constant variance assumption is not 
satisfied. 

We will apply the modified Levene test to the peak discharge data. The upper panel 
of Table 3-7 contains the treatment medians y, and the lower panel contains the deviations 
dij around the medians. Levene’s test consists of conducting a standard analysis of vari¬ 
ance on the dij. The F test statistic that results from this is F 0 = 4.55, for which the P- 
value is P = 0.0137. Therefore, Levene’s test rejects the null hypothesis of equal 
variances, essentially confirming the diagnosis we made from visual examination of 
Figure 3-7. The peak discharge data is a good candidate for data transformation. 


Empirical Selection of a Transformation 

We observed above that, if experimenters knew the relationship between the variance of 
the observations and the mean, they could use this information to guide them in selecting 


Table 3-8 Analysis of Variance for Peak Discharge Data 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

F 0 

P-Value 

Methods 

708.3471 

3 

236.1157 

76.07 

<0.001 

Error 

62.0811 

20 

3.1041 



Total 

770.4282 

23 
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Figure 3-7 Plot of residuals versus y,j for Example 3-5. 


the form of the transformation. We now elaborate on this point and show one method 
for empirically selecting the form of the required transformation from the data. 

Let E(y) = /x be the mean of y, and suppose that the standard deviation of y is 
proportional to a power of the mean of y such that 

Uy ^ 

We want to find a transformation on y that yields a constant variance. Suppose that the 
transformation is a power of the original data, say 

y* = / (3-20) 

Then it can be shown that 

«= m a+ “ -1 (3-21) 

Clearly, if we set A = 1 — a, the variance of the transformed data y* is constant. 

Several of the common transformations discussed previously are summarized in Ta¬ 
ble 3-9. Note that A = 0 implies the log transformation. These transformations are ar¬ 
ranged in order of increasing strength. By the strength of a transformation we mean the 
amount of curvature it induces. A mild transformation applied to data spanning a narrow 


Table 3-9 Variance-Stabilizing Transformations 


Relationship 
Between it, and fi 

a 

A = 1 - a 

Transformation 

Comment 

cr v oc constant 

0 

1 

No transformation 


CT y oc fi 112 

1/2 

1/2 

Square root 

Poisson (count) data 

(T y oc IX 

1 

0 

Log 


CTy OC /L ' 

3/2 

-1/2 

Reciprocal square root 


cr v oc fx 2 

2 

-1 

Reciprocal 
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-10123 

logiv 

Figure 3-8 Plot of log S, versus log y L for the peak 
discharge data from Example 3-5. 


range has little effect on the analysis, whereas a strong transformation applied over a 
large range may have dramatic results. Transformations often have little effect unless the 
ratio y max /y min is larger than 2 or 3. 

In many experimental design situations where there is replication, we can empirically 
estimate a from the data. Because in the ith treatment combination a y . oc 
where 6 is a constant of proportionality, we may take logs to obtain 

log cr y . = log 0 + a log fa (3-22) 

Therefore, a plot of log ar y . versus log would be a straight line with slope a. Because 
we don’t know <r y . and /a,, we may substitute reasonable estimates of them in Equation 
3-22 and use the slope of the resulting straight line fit as an estimate of a. Typically, we 
would use the standard deviation 5, and the average y L of the ith treatment (or, more 
generally, the ith treatment combination or set of experimental conditions) to estimate 
a-y. and fij. 

To investigate the possibility of using a variance-stabilizing transformation on the 
peak discharge data from Example 3-5, we plot log 5, versus log y L in Figure 3-8. The 
slope of a straight line passing through these four points is close to 1/2 and, from Table 
3-9, this implies that the square root transformation may be appropriate. The analysis of 
variance for the transformed data y* = Vy is presented in Table 3-10, and a plot of 


Table 3-10 Analysis of Variance for Transformed Peak Discharge Data, y* = Vy 
Source of Sum of Degrees of Mean 

Variation Squares Freedom Square F 0 P -Value 

Methods 32.6842 3 10.8947 76.99 <0.001 

Error 2.6884 19 0.1415 

Total 35.3726 22 
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Figure 3-9 Plot of residuals from transformed data versus 
y* for the peak discharge data in Example 3-5. 


residuals versus the predicted response is shown in Figure 3-9. This residual plot is much 
improved in comparison to Figure 3-7, so we conclude that the square root transformation 
has been helpful. Note that in Table 3-10 we have reduced the error degrees of freedom 
by 1 to account for the use of the data to estimate the transformation parameter a. 

In practice, many experimenters select the form of the transformation by simply 
trying several alternatives and observing the effect of each transformation on the plot of 
residuals versus the predicted response. The transformation that produced the most sat¬ 
isfactory residual plot is then selected. 

3-4.4 Plots of Residuals Versus Other Variables 

If data have been collected on any other variables that might possibly affect the response, 
the residuals should be plotted against these variables. For example, in the tensile strength 
experiment of Example 3-1, strength may be significantly affected by the thickness of 
the fiber, so the residuals should be plotted versus fiber thickness. If different testing 
machines were used to collect the data, the residuals should be plotted against machines. 
Patterns in such residual plots imply that the variable affects the response. This suggests 
that the variable should be either controlled more carefully in future experiments or 
included in the analysis. 


3-5 PRACTICAL INTERPRETATION OF RESULTS 

After conducting the experiment, performing the statistical analysis, and investigating 
the underlying assumptions, the experimenter is ready to draw practical conclusions about 
the problem he or she is studying. Often this is relatively easy, and certainly in the simple 
experiments we have considered so far, this might be done somewhat informally, perhaps 
by inspection of graphical displays such as the box plots and scatter diagram in Figures 
3-1 and 3-2. However, in some cases, more formal techniques need to be applied. We 
will present some of these techniques in this section. 
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3-5.1 A Regression Model 

The factors involved in an experiment can be either quantitative or qualitative. A quan¬ 
titative factor is one whose levels can be associated with points on a numerical scale, 
such as temperature, pressure, or time. Qualitative factors, on the other hand, are factors 
for which the levels cannot be arranged in order of magnitude. Operators, batches of raw 
material, and shifts are typical qualitative factors because there is no reason to rank them 
in any particular numerical order. 

Insofar as the initial design and analysis of the experiment are concerned, both types 
of factors are treated identically. The experimenter is interested in determining the dif¬ 
ferences, if any, between the levels of the factors. If the factor is qualitative, such as 
operators, it is meaningless to consider the response for a subsequent run at an inter¬ 
mediate level of the factor. However, with a quantitative factor such as time, the exper¬ 
imenter is usually interested in the entire range of values used, particularly the response 
from a subsequent run at an intermediate factor level. That is, if the levels 1.0, 2.0, and 
3.0 hours are used in the experiment, we may wish to predict the response at 2.5 hours. 
Thus, the experimenter is frequently interested in developing an interpolation equation 
for the response variable in the experiment. This equation is an empirical model of the 
process that has been studied. 

The general approach to fitting empirical models is called regression analysis, which 
is discussed extensively in Chapter 10. See also the supplemental text material for this 
chapter. This section briefly illustrates the technique using the tensile strength data of 
Example 3-1. 

Figure 3-10 presents a scatter diagram of tensile strength y versus the weight per¬ 
centage of cotton x in the fabric for the experiment in Example 3-1. The open circles on 
the graph are the mean tensile strengths at each value of cotton weight percentage x. 
From examining the scatter diagram, it is clear that the relationship between tensile 


251 


20 


°0 10 20 30 40 

Cotton weight percentage 

Figure 3-10 Scatter diagram for the tensile strength 
data of Example 3-1. 





88 


CHAPTER 3 EXPERIMENTS WITH A SINGLE FACTOR: THE ANALYSIS OF VARIANCE 


strength and weight percentage cotton is not linear. As a first approximation, we could 
try fitting a quadratic model to the data, say 

y = Po + Pi* + P 2 X 2 + e 

where j8 0 , /3,, and /3 2 are unknown parameters to be estimated and e is a random error 
term. The method often used to estimate the parameters in a model such as this is the 
method of least squares. This consists of choosing estimates of the jS’s such that the sum 
of the squares of the errors (the e’s) are minimized. The least squares fit in our example 


y = -39.9886 + 4.596* - 0.0886* 2 

(If you are unfamiliar with regression methods, see Chapter 10 and the supplemental text 
material for this chapter.) 

This quadratic model is shown in Figure 3-10. It does not appear to be very satis¬ 
factory because it drastically underestimates the responses at * = 30 percent cotton and 
overestimates the responses at * = 25 percent. Perhaps an improvement can be obtained 
by adding a cubic term in x. The resulting cubic model fit is 

3 = 62.6114 - 9.0114* + 0.4814* 2 - 0.0076* 3 

This cubic fit is also shown in Figure 3-10. The cubic model appears to be superior to 
the quadratic because it provides a better fit at * = 25 and * = 30 percent cotton. 

In general, we would like to fit the lowest-order polynomial that adequately describes 
the system or process. In this example, the cubic polynomial seems to fit better than the 
quadratic, so the extra complexity of the cubic model is justified. Selecting the order of 
the approximating polynomial is not always easy, however, and it is relatively easy to 
overfit, that is, to add high-order polynomial terms that do not really improve the fit but 
increase the complexity of the model and often damage its usefulness as a predictor or 
interpolation equation. 

In this example, the empirical model could be used to predict mean tensile strength 
at values of cotton weight percentage within the region of experimentation. In other cases, 
the empirical model could be used for process optimization, that is, finding the levels 
of the design variables that result in the best values of the response. We will discuss and 
illustrate these problems extensively later in the book. 


3-5.2 Comparisons Among Treatment Means 

Suppose that in conducting an analysis of variance for the fixed effects model the null 
hypothesis is rejected. Thus, there are differences between the treatment means, but 
exactly which means differ is not specified. Sometimes in this situation, further compar¬ 
isons and analysis among groups of treatment means may be useful. The z'th treatment 
mean is defined as /z, = n + t„ and /x, is estimated by y L . Comparisons between treatment 
means are made in terms of either the treatment totals {y,.} or the treatment averages 
{y,-.}. The procedures for making these comparisons are usually called multiple com¬ 
parison methods. In the next several sections, we discuss methods for making compar¬ 
isons among individual treatment means or groups of these means. 
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3-5.3 Graphical Comparisons of Means 


It is very easy to develop a graphical procedure for the comparison of means following 
an analysis of variance. Suppose that the factor of interest has a levels and that 
y u , y 2 ,... ,y a . are the treatment averages. If we know a, any treatment average would 
have a standard deviation tr/Vn. Consequently, if all factor level means are identical, 
the observed sample means y L would behave as if they were a set of observations drawn 
at random from a normal distribution with mean y_ and standard deviation cr/Vn. Vi¬ 
sualize a normal distribution capable of being slid along an axis below which the 
y u y 2 .,... ,y a . are plotted. If the treatment means are all equal, there should be some 
position for this distribution that makes it obvious that the y L values were drawn from 
the same distribution. If this is not the case, the y im values that appear not to have been 
drawn from this distribution are associated with factor levels that produce different mean 
responses. 

The only flaw in this logic is that cr is unknown. However, we can replace a with 
VM5£ from the analysis of variance and use a t distribution with a scale factor 
VMS E /n instead of the normal. Such an arrangement for the tensile strength data of 
Example 3-1 is shown in Figure 3-11. 

To sketch the t distribution in Figure 3-11, simply multiply the abscissa t value by 
the scale factor 


and plot this against the ordinate of t at that point. Because the t distribution looks much 
like the normal, except that it is a little flatter near the center and has longer tails, this 
sketch is usually easily constructed by eye. If you wish to be more precise, there is a 
table of abscissa t values and the corresponding ordinates in Box, Hunter, and Hunter 
(1978). The distribution can have an arbitrary origin, although it is usually best to choose 
one in the region of the y,. values to be compared. In Figure 3-11, the origin is 15 lb/in 2 . 

Now visualize sliding the t distribution in Figure 3-11 along the horizontal axis, and 
examine the five means plotted in the figure. Notice that there is no location for the 
distribution such that all five averages could be thought of as typical, randomly selected 
observations from the distribution. This implies that all five means are not equal; thus, 
the figure is a graphical display of the results of the analysis of variance. The figure does 
indicate that 30 percent cotton produces much higher tensile strengths than would 20 or 
25 percent cotton (which are about the same) or that either 15 or 35 percent cotton (which 
are about the same) would result in even lower tensile strengths. 


I_ 15 j 35 _ ,20 25 , 30 _I 

5 10 15 20 25 

Average tensile strength 
(lb/in 2 ) 

Figure 3-11 Tensile strength averages from the cotton weight 
percenta ge expe rime nt in re lation to a t distribution with a scale 
factor VMS £ /n = V8.06/5 = 1.27. 
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This simple procedure is a rough but effective technique for many multiple com¬ 
parison problems. However, there are more formal methods. We now give a brief dis¬ 
cussion of some of these procedures. 


3-5.4 Contrasts 

Many multiple comparison methods use the idea of a contrast. Consider the synthetic 
fiber testing problem of Example 3-1. Because the null hypothesis was rejected, we know 
that some cotton weight percentages produce different tensile strengths than others, but 
which ones actually cause this difference? We might suspect at the outset of the exper¬ 
iment that levels 4 and 5 cotton weight percentages (30 and 35 percent) produce the 
same tensile strength, implying that we would like to test the hypothesis 

= fi 5 

H\:/i 4 * Vs 


or equivalently, 

Ho-fi4~ Us = 0 (3-23) 

Hi'.fi* - fi 5 # 0 

If we had suspected at the start of the experiment that the average of the lowest levels 
of cotton weight percentages (1 and 2) did not differ from the average of the highest 
levels of cotton weight percentages (4 and 5), then the hypothesis would have been 

Ho’.fii + Pa = 1*4 + Ms 
+ 1^2 * ia 4 + /i 5 


or 


Ho'./ii + fi2 - ix 4 - fi 5 = 0 (3-24) 

Hi:Hi + fJh ~ - Us = 0 

In general, a contrast is a linear combination of parameters of the form 

F = J) Ct/xt 

where the contrast constant c u c 2 ,...,c a sum to zero; that is, 2“ =1 c t = 0. Both of the 
hypotheses above can be expressed in terms of contrasts: 


Hoi'S, cm = 0 
HiiS C <7A * 0 


(3-25) 


The contrast constants for the hypotheses in Equation 3-23 are Ci = c 2 = c 3 = 0, c 4 = +1, 
and c 5 = —1, whereas for the hypotheses in Equation 3-24 they are c, = c 2 = +1, 
c 3 = 0, and c 4 = c 5 = — 1. 
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Testing hypotheses involving contrasts can be done in two basic ways. The first 
method uses a t-test. Write the contrast of interest in terms of the treatment totals, giving 

c = 2 CM. 

The variance of C is 


V(C) = na 2 X d (3-26) 

when the sample sizes in each treatment are equal. If the null hypothesis in Equation 
3-25 is true, the ratio 


2 c,y L 



has the N( 0, 1) distribution. Now we would replace the unknown variance cr 2 by its 
estimate, the mean square error MS E and use the statistic 


to 



(3-27) 


to test the hypotheses in Equation 3-25. The null hypothesis would be rejected if 1 1 0 | in 
Equation 3-27 exceeds t a/2 ,N- a - 

The second approach uses an F test. Now the square of a t random variable with v 
degrees of freedom is an F random variable with 1 numerator and v denominator degrees 
of freedom. Therefore, we can obtain 



F 0 — tl = - 3 - (3-28) 

nMS E 2 cf 


as an F statistic for testing Equation 3-25. The null hypothesis would be rejected if 
Fo > F a ,\,N-a- We can write the test statistic of Equation 3-28 as 

_ MS C _ SS c n 
0 ms e ms e 


where the single degree of freedom contrast sum of squares is 
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Confidence Interval for a Contrast 

Instead of testing hypotheses about a contrast, it may be more useful to construct a 
confidence interval. The contrast is then often written in terms of treatment averages y f . 
Suppose that the contrast of interest is 


r = 2 ^ 

Replacing the treatment means with the treatment averages yields 

c = 2 c Ji. 
and 




when the sample sizes are equal. If we use MS E to estimate a 2 , the 100(1 - a) percent 
confidence interval on the contrast 2f =1 c,/x,- is 


2 c -y<. - tar. 


I MSe 2 


= 2 Cilli =£ 2 c,y L + tanji-t 


zz£ V 
n h " 


Clearly if this confidence interval includes zero, we would be unable to reject the null 
hypothesis in Equation 3-25. 


Standardized Contrast 

When more than one contrast is of interest, it is often useful to evaluate them on the 
same scale. One way to do this is to standardize the contrast so that it has variance o 2 . 
If the cont rast 2? =1 c,ja, is written in terms of treatment totals as 2f =1 c,y,-., dividing it by 
V«2“ =1 cf will produce a standardized contrast with variance a 2 . Effectively, then, the 

standardized contrast is 


where 


2 chi. 



Unequal Sample Sizes 

When the sample sizes in each treatment are different, minor modifications are made in 
the above results. First, note that the definition of a contrast now requires that 
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Other required changes are straightforward. For example, the t statistic in Equation 3-27 
becomes 


2 C ‘ yi - 

JmS e | n,cj 

and the contrast sum of squares from Equation 3-29 becomes 



3«5.5 Orthogonal Contrasts 

A useful special case of the procedure in Section 3-5.4 is that of orthogonal contrasts. 
Two contrasts with coefficients {c, } and {</, } are orthogonal if 

2 c,d, = 0 

or, for an unbalanced design, if 

2 nfiidi = 0 

For a treatments the set of a - 1 orthogonal contrasts partition the sum of squares due 
to treatments into a - 1 independent single-degree-of-freedom components. Thus, tests 
performed on orthogonal contrasts are independent. 

There are many ways to choose the orthogonal contrast coefficients for a set of 
treatments. Usually, something in the nature of the experiment should suggest which 
comparisons will be of interest. For example, if there are a = 3 treatments, with treatment 
1 a control and treatments 2 and 3 actual levels of the factor of interest to the experi¬ 
menter, appropriate orthogonal contrasts might be as follows: 


Treatment 

Coefficients for 
Orthogonal Contrasts 

1 (control) 

-2 0 

2 (level 1) 

1 -1 

3 (level 2) 

1 1 


Note that contrast 1 with c, = -2, 1, 1 compares the average effect of the factor with 
the control, whereas contrast 2 with d t = 0, — 1, 1 compares the two levels of the factor 
of interest. 

Generally, the method of contrasts (or orthogonal contrasts) is useful for what are 
called preplanned comparisons. That is, the contrasts are specified prior to running the 
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experiment and examining the data. The reason for this is that, if comparisons are selected 
after examining the data, most experimenters would construct tests that correspond to 
large observed differences in means. These large differences could be the result of the 
presence of real effects or they could be the result of random error. If experimenters 
consistently pick the largest differences to compare, they will inflate the type I error of 
the test because it is likely that, in an unusually high percentage of the comparisons 
selected, the observed differences will be the result of error. Examining the data to select 
comparisons of potential interest is often called data snooping. The Scheffe method for 
all comparisons, discussed in the next section, permits data snooping. 

EXAMPLE 3-6 . 

Consider the data in Example 3-1. There are five treatment means and four degrees of 
freedom between these treatments. Suppose that prior to running the experiment the 
following set of comparisons among the treatment means (and their associated contrasts) 
were specified: 


Hypothesis 

H 0 :/x 4 = fx 5 

Ho-l*- 1 + M'S = M* + Ms 

Ho'./Xi = il 3 

Hq-A/Jv = Mi + Ms + Mt + Ms 


Contrast 

Cl = - y 4 . + ys. 

C 2 = yi. + y 3 . - y*. - ys. 

C 3 = yi. ~ y 3 . 

c 4 = -yi. + 4 y 2 . - y 3 . - y 4 . - y 5 . 


Notice that the contrast coefficients are orthogonal. Using the data in Table 3-4, we find 
the numerical values of the contrasts and the sums of squares to be as follows: 


Ci 

C 2 

c 3 

c 4 


- 1(108) + 1(54) = -54 SS Cl 

+ 1(49) + 1(88) - 1(108) - 1(54) = -25 SS Cl 

+ 1(49) - 1(88) = -39 SSc 3 

-1(49) + 4(77) - 1(88) - 1(108) - 1(54) = 9 SS C4 


(~54) 2 
5(2) 
(~25) 2 
5(4) 
(—39) 2 
5(2) 
(9) 2 
5(20) 


= 291.60 
= 31.25 
= 152.10 
0.81 


Table 3-11 Analysis of Variance for the Tensile Strength Data 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Co 

P-Value 

Cotton Weight Percentage 

475.76 

4 

118.94 

14.76 

<0.001 

orthogonal contrasts 






C,:/x 4 = Ms 

(291.60) 

1 

291.60 

36.18 

<0.001 

C 2 :/Xi + fjLj = + fi s 

(31.25) 

1 

31.25 

3.88 

0.06 

C 3 --H 1 = Ms 

(152.10) 

1 

152.10 

18.87 

<0.001 

C 4 :4/u, 2 = Mi + Ms + M 4 + Ms 

(0.81) 

1 

0.81 

0.10 

0.76 

Error 

161.20 

20 

8.06 



Total 

636.96 

24 
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These contrast sums of squares completely partition the treatment sum of squares. The 
tests on such orthogonal contrasts are usually incorporated in the analysis of variance, 
as shown in Table 3-11. We conclude from the P-values that there are significant dif¬ 
ferences between levels 4 and 5 and 1 and 3 of cotton weight percent but that the average 
of levels 1 and 3 does not differ from the average of levels 4 and 5 at the a = 0.05 level, 
and that level 2 does not differ from the average of the other four levels. 


3-5.6 Scheffe’s Method for Comparing All Contrasts 

In many situations, experimenters may not know in advance which contrasts they wish 
to compare, or they may be interested in more than a — 1 possible comparisons. In many 
exploratory experiments, the comparisons of interest are discovered only after prelimi¬ 
nary examination of the data. Scheffe (1953) has proposed a method for comparing any 
and all possible contrasts between treatment means. In the Scheffe method, the type I 
error is at most a for any of the possible comparisons. 

Suppose that a set of m contrasts in the treatment means 

r„ = Ciu/Xi + c 2 u/J<i + • • • + c au fx a u = 1, 2,..., m (3-31) 

of interest have been determined. The corresponding contrast in the treatment averages 

y,. is 

C u = c lu y u + c 2 „y 2 . + • • • + c au y a , u = 1, 2 m (3-32) 

and the standard error of this contrast is 



where n, is the number of observations in the ith treatment. It can be shown that the 
critical value against which C„ should be compared is 

So* = S c „V(a - 1)F«,(3-34) 
To test the hypothesis that the contrast differs significantly from zero, refer C„ to the 
critical value. If | C u | > S aM , the hypothesis that the contrast T u equals zero is rejected. 

The Scheffe procedure can also be used to form confidence intervals for all possible 
contrasts among treatment means. The resulting intervals, say C„ — S a>u =£ T„ =£ C„ + 
S a>u , are simultaneous confidence intervals in that the probability is at least 1 - a that 
all of them are simultaneously true. 

To illustrate the procedure, consider the data in Example 3-1 and suppose that the 
contrasts of interests are 

Ti = /r, + /x 3 - yx 4 - fi 5 

and 

r 2 = ju.i - fi 4 

The numerical values of these contrasts are 

Ci = fi. + y 3 . - %. ~ % 

= 9.80 + 17.60 - 21.60 - 10.80 
= 5.00 
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and 


c 2 = yu - y*. 

= 9.80 - 21.60 
= -11.80 

and the standard errors are found from Equation 3-33 as 


S c , = JmS e X (cfJnd = V8.06(l + 1 + 1 + l)/5) = 2.54 
and 



From Equation 3-34, the 1 percent critical values are 

So.oi,i = S c ,V(a - l)F 0 . 0Ua . LN . a = 2.54V4(443) = 10.69 
and 

So. 01,2 = Sc 2 S/ (a — l)Fo.oi,a-i,v-a = 1.80V4(4.43) = 7.58 

Because |C t | < S 0 .oi,i> we conclude that the contrast V { ^ - /jl 4 - /a 5 equals 

zero; that is, there is no strong evidence to conclude that the means of treatments 1 and 
3 as a group differ from the means of treatments 4 and 5 as a group. However, because 
| C 2 1 > S 0 .oi, 2 : we conclude that the contrast T 2 = fii - /a 4 does not equal zero; that is, 
the mean strengths of treatments 1 and 4 differ significantly. 

In many practical situations, we will wish to compare only pairs of means. Fre¬ 
quently, we can determine which means differ by testing the differences between all 
pairs of treatment means. Thus, we are interested in contrasts of the form r = /a, — jXj 
for all i + j. Although the Scheffe method could be easily applied to this problem, it is 
not the most sensitive procedure for such comparisons. We now turn to a consideration 
of methods specifically designed for pairwise comparisons between all a population 
means. 


3-5.7 Comparing Pairs of Treatment Means 

Suppose that we are interested in comparing all pairs of a treatment means and that the 
null hypotheses that we wish to test are H 0 : /a, = /a, for all i =£ j. We now present four 
methods for making such comparisons. 

Tukey’s Test 

Suppose that, following an analysis of variance in which we have rejected the null hy¬ 
pothesis of equal treatment means, we wish to test all pairwise mean comparisons: 

Ho : / a , = 

Hi - n, * /a j 

for all i j. Tukey (1953) proposed a procedure for testing hypotheses for which the 
overall significance level is exactly a when the sample sizes are equal and at most a 
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when the sample sizes are unequal. His procedure can also be used to contract confidence 
intervals on the differences in all pairs of means. For these intervals, the simultaneous 
confidence level is 100(1 — a) percent when the sample sizes are equal and at least 
100(1 - a) percent when sample sizes are unequal. This is an excellent data snooping 
procedure when interest focuses on pairs of means. 

Tukey’s procedure makes use of the distribution of the studentized range statistic 



where y max and y min are the largest and smallest sample means, respectively, out of a 
group of p sample means. Appendix Table VIII contains values of q a (p,f), the upper a 
percentage points of q where/is the number of degrees of freedom associated with the 
MS e . For equal sample sizes, Tukey’s test declares two means significantly different if 
the absolute value of their sample differences exceeds 

T a = q a (a,f)J ^ (3-35) 

Equivalently, we could construct a set of 100(1 - a) percent confidence intervals for all 
pairs of means as follows: 


yt. - yj. - <7«(a,/) /— « - hj 


yu - yj. + <■. 

When sample sizes are not equal, Equations 3-35 and 3-36 become 






(3-36) 


(3-37) 


q a (gj) 

V2 


'MS E i- + —) *s /i, - Pj 


< 3 - 38 > 


respectively. The unequal sample size version is sometimes called the Tukey-Kramer 
procedure. 


EXAMPLE 3-7 . 

To illustrate Tukey’s test, we use the data from the cotton weight percentage experiment 
in Example 3-1. With a = 0.05 and/= 20 degrees of freedom for error. Appendix Table 
VIII gives < 70 . 05 ( 5 , 20) = 4.23. Therefore, from Equation 3-35, 


7*0.05 = <70.05(5, 20 ) = 4.23 = 5.37 
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.H. y 5 . y 3 . y*. 

9.8 10.8 15.4 17.6 21.6 


Figure 3-12 Results of Tukey’s test. 


Thus, any pair of treatment averages that differ in absolute value by more than 5.37 
would imply that the corresponding pair of population means are significantly different. 
The five treatment averages are 

y u = 9.8 y 2 . = 15.4 y 3 . = 17.6 y 4 . = 21.6 y 5 . = 10.8 

and the differences in averages are 


yi. 

- yi. 

= 

9.8 - 

15.4 = 

-5.6* 

?i. 

- h. 

= 

9.8 - 

17.6 = 

-7.8* 

yu 

- %. 

= 

9.8 - 

21.6 = 

-11.8* 

yu 

- ys. 

= 

9.8 - 

10.8 = 

-1.0 

yi. 

- ys. 

= 

15.4 - 

17.6 = 

-2.2 

yi. 

- %. 

= 

15.4 - 

21.6 = 

-6.2* 

yi. 

- ys. 

= 

15.4 - 

10.8 = 

4.6 

h. 

- %. 


17.6 - 

21.6 = 

-4.0 

)>3. 

- ys. 

= 

17.6 - 

10.8 = 

6.8* 


- ys. 

= 

21.6 - 

10.8 = 

10.8* 


The starred values indicate pairs of means that are significantly different. It is usually 
helpful to draw a graph, such as in Figure 3-12, underlining pairs of means that do not 
differ significantly. This graph gives an indication that the treatment means form three 
groups: fii and /i 5 , /li 2 and /a 3 , and /x 4 . However, the membership in these groups is not 
completely distinct. 


When using any procedure for pairwise testing of means we occasionally find that 
the overall F test from the analysis of variance is significant, but the pairwise comparison 
of means fails to reveal any significant differences. This situation occurs because the F 
test is simultaneously considering all possible contrasts involving the treatment means, 
not just pairwise comparisons. That is, in the data at hand, the significant contrasts may 
not be of the form /i, - fij. 

Some computer software packages present pairwise comparisons with confidence 
intervals. For the Tukey procedure, these intervals would be computed from either Equa¬ 
tion 3-36 or 3-38, depending on whether or not the sample sizes are equal. 

The derivation of the Tukey confidence interval of Equation 3-36 for equal sample 
sizes is straightforward. For the studentized range statistic q we have 

f /max0, - jQ - minty. - ^ A = , _ „ 

\ VMS E /n ) 
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If max(y,» - /x,) - min(y,. - /x,) is less than or equal to q a (a,fWMS E /n, it must be true 
that | (y L - /Aj) — (jj, - fLj) | =£ q u (a,f)\/MS F /n for every pair of means. Therefore 

P^-qJfl,f)J^. =£ y L - y h - (/x, - /x,) * 

Rearranging this expression to isolate /x, — /lx, between the inequalities will lead to the 
set of 100(1 - a) percent simultaneous confidence intervals given in Equation 3-38. 

The Fisher Least Significant Difference (LSD) Method 
This procedure uses the F statistic for testing H 0 : /u,, = /lx, 


pair of means /lx, 
,_ a VM5 £ (l/n, + 


Assuming a two-sided alternative, the pair of means /lx, and /u., would be declared sig¬ 
nificantly different if \y L - y,. | > t a / 2 ,v-aVMS £ (l/n, + l/n y ). The quantity 


is called the least significant difference. If the design is balanced, 
n a = n, and 


LSD = t a/ 


(3-41) 


To use the Fisher LSD procedure, we simply compare the observed difference be¬ 
tween each pair of averages to the corresponding LSD. If |y,. — y,. | > LSD, we conclude 
that the population means /u, and /x, differ. 

EXAMPLE 3-8 . 

To illustrate the procedure, if we use the data from the experiment in Example 3-1, the 
LSD at a = 0.05 is 

LSD = t Q2 

\ n y D 

Thus, any pair of treatment averages that differ in absolute value by more than 3.75 
would imply that the corresponding pair of population means are significantly different. 
The differences in averages are 


K 

- yi. 

= 

9.8 - 

15.4 - 

-5.6* 

yi. 

- 5*3. 

= 

9.8 - 

17.6 = 

-7.8* 

yu 

n. 

= 

9.8 - 

21.6 = 

-11.8* 

yi. 

- ys. 

= 

9.8 - 

10.8 = 

-1.0 

5*2. 

- h. 

= 

15.4 - 

17.6 = 

-2.2 

yi. 

~ 5*4. 

= 

15.4 - 

21.6 = 

-6.2* 

yi. 

- ^5. 

= 

15.4 - 

10.8 = 

4.6* 

yi. 

- 5*4. 

= 

17.6 - 

21.6 = 

-4.0* 

5*3. 

- 5*5. 

= 

17.6 - 

10.8 = 

6.8* 

^4. 

5*5. 

= 

21.6 - 

10.8 = 

10.8* 
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y 6 . X 2 . 3-3. y t . 

9.8 10.8 15.4 17.6 21.6 


Figure 3-13 Results of the LSD procedure. 


The starred values indicate pairs of means that are significantly different. Figure 3-13 
summarizes the results. Clearly, the only pairs of means that do not differ significantly 
are 1 and 5 and 2 and 3, and treatment 4 produces a significantly greater tensile strength 
than the other treatments. 


Note that the overall a risk may be considerably inflated using this method. Specif¬ 
ically, as a gets larger, the type I error of the experiment (the ratio of the number of 
experiments in which at least one type I error is made to the total number of experiments) 
becomes large. 

Duncan’s Multiple Range Test 

A widely used procedure for comparing all pairs of means is the multiple range test 
developed by Duncan (1955). To apply Duncan’s multiple range test for equal sample 
sizes, the a treatment averages are arranged in ascending order, and the standard error 
of each average is determined as 



For unequal sample sizes, replace n in Equation 3-42 by the harmonic mean n h of the 
{«,}, where 


n k = -a- 1 - (3-43) 

2 dM) 

Note that if n x = n 2 = • • • = n a , n h = n. From Duncan’s table of significant ranges 
(Appendix Table VII), obtain the values r a (p, /) for p = 2, 3,. .., a, where a is the 
significance level and / is the number of degrees of freedom for error. Convert these 
ranges into a set of a — 1 least significant ranges (e.g., R p ) for p = 2, 3,..., a by 
calculating 

R P = r a (p, f)Sy L for p = 2, 3,..., a (3-44) 

Then, the observed differences between means are tested, beginning with largest versus 
smallest, which would be compared with the least significant range R a . Next, the differ¬ 
ence of the largest and the second-smallest is computed and compared with the least 
significant range R a ,. These comparisons are continued until all means have been com¬ 
pared with the largest mean. Finally, the difference between the second-largest mean and 
the smallest is computed and compared against the least significant range R a _ t . This 
process is continued until the differences between all possible a(a - l)/2 pairs of means 
have been considered. If an observed difference is greater than the corresponding least 
significant range, we conclude that the pair of means in question is significantly different. 
To prevent contradictions, no differences between a pair of means are considered sig- 
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nificant if the two means involved fall between two other means that do not differ 
significantly. 

EXAMPLE 3-9 . 

We can apply Duncan’s multiple range test to the experiment of Example 3-1. Recall 
that MS e = 8.06, IV = 25, n = 5, and there are 20 error degrees of freedom. Ranking 
the treatment averages in ascending order, we have 

yu= 9.8 
y 5 _ = 10.8 
y 2 . = 15.4 
y 3 . = 17.6 
%. = 21.6 

The standard error of each average is S y . = V8.06/5 = 1.27. From the table of signifi¬ 
cant ranges in Appendix Table VII for 20 degrees of freedom and a = 0.05, we obtain 
>0.05(2, 20) = 2.95, r 005 (3, 20) = 3.10, r 0 . 05 (4, 20) = 3.18, and r 0 . 05 (5, 20) = 3.25. 
Thus, the least significant ranges are 

R 2 = r 0 .o 5 (2, 20)Sy. = (2.95X1.27) = 3.75 
R s = r 005 (3, 20 )Sy. = (3.10X1.27) = 3.94 
R 4 = > 0 . 05 ( 4 , 20)Sy t = (3.18X1.27) = 4.04 
Rs = >-0.05(5, 20)Sy. = (3.25)(1.27) = 4.13 
The comparisons would yield 

4 vs. 1: 21.6 - 9.8 = 11.8 > 4.13(/? 5 ) 

4 vs. 5: 21.6 - 10.8 = 10.8 > 4.04 (R 4 ) 

4 vs. 2: 21.6 - 15.4 = 6.2 > 3.94 (R 3 ) 

4 vs. 3: 21.6 - 17.6 = 4.0 > 3.75 (R 2 ) 

3 vs. 1: 17.6 - 9.8 = 7.8 > 4.04(R 4 ) 

3 vs. 5: 17.6 - 10.8 = 6.8 > 3.95(R 3 ) 

3 vs. 2: 17.6 - 15.4 = 2.2 < 3.75 (R 2 ) 

2 vs. 1: 15.4 - 9.8 = 5.6 > 3.94(/? 3 ) 

2 vs. 5: 15.4 - 10.8 = 4.6 > 3.75 (R 2 ) 

5 vs. 1: 10.8 - 9.8 = 1.0 < 3.75(R 2 ) 

From the analysis we see that there are significant differences between all pairs of means 
except 3 and 2 and 5 and 1. A graph underlying those means that are not significantly 
different is shown in Figure 3-14. Notice that, in this example, Duncan’s multiple range 
test and the LSD method produce identical conclusions. 


fu y 5 . y 2 . >3. y 4 . 

9.8 10.8 15.4 17.6 21.£ 

Figure 3-14 


Results of Duncan’s multiple range 
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Duncan’s test requires a greater observed difference to detect significantly different 
pairs of means as the number of means included in the group increases. For instance, in 
the above example R 2 = 3.75 (two means) whereas R 3 = 3.94 (three means). For two 
means, the critical value R 2 will always exactly equal the LSD value from the t test. The 
values r a (p,f) in Appendix Table VII are chosen so that a specified protection level is 
obtained. That is, when two means that are p steps apart are compared, the protection 
level is (1 - a) p 1 , where a is the level of significance specified for two adjacent means. 
Thus, the error rate is 1 - (1 - a) p ~ l of reporting at least one incorrect significant 
difference between two means when the group size is p. For example, if a = 0.05, then 
1 - (1 - 0.05) 1 = 0.05 is the significance level for comparing the adjacent pair of 
means, 1 - (1 - 0.05) 2 = .10 is the significance level for means that are one step apart, 
and so forth. 

Generally, if the protection level is a, tests on means have a significance level that 
is greater than or equal to a. Consequently, Duncan’s procedure is quite powerful; that 
is, it is very effective at detecting differences between means when real differences exist. 
For this reason, Duncan’s multiple range test is very popular. 

The Newman-Keuls Test 

This test was devised by Newman (1939). Because new interest in Newman’s test was 
generated by Keuls (1952), the procedure is usually called the Newman-Keuls test. Op¬ 
erationally, the procedure is similar to Duncan’s multiple range test, except that the 
critical differences between means are calculated somewhat differently. Specifically, we 
compute a set of critical values 

K p = UP, f)Sy L p = 2, 3,.... a (3-45) 

where q a (p, f ) is the upper a percentage point of the studentized range for groups of 
means of size p and / error degrees of freedom. Once the values K p are computed from 
Equation 3-45, extreme pairs of means in groups of size p are compared with K p exactly 
as in Duncan’s multiple range test. 

Which Pairwise Comparison Method Do I Use? 

Certainly a logical question at this point is Which one of these procedures should I use? 
Unfortunately, there is no clear-cut answer to this question, and professional statisticians 
often disagree over the utility of the various procedures. Carmer and Swanson (1973) have 
conducted Monte Carlo simulation studies of a number of multiple comparison procedures, 
including others not discussed here. They report that the least significant difference method 
is a very effective test for detecting true differences in means if it is applied only after 
the F test in the analysis of variance is significant at 5 percent. They also report good 
performance in detecting true differences with Duncan’s multiple range test. This is not 
surprising, because these two methods are the most powerful of those we have discussed. 
However, these methods do not contain the experimentwise error rate. Because the Tukey 
method does control the overall error rate, many statisticians prefer to use it. 

The Newman-Keuls test is more conservative than Duncan’s test in that the type I 
error rate is smaller. Specifically, the type I error of the experiment is a for all tests 
involving the same number of means. Consequently, because a is generally lower, the 
power of the Newman-Keuls test is generally less than that of Duncan’s test. To dem¬ 
onstrate that the Newman-Keuls procedure leads to a less powerful test than Duncan’s 
multiple range test, we note from a comparison of Tables VII and VIII of the Appendix 
that for p > 2 we have q a (p,f) > r a (p,f). That is, it is “harder” to declare a pair of 
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means significantly different using the Newman-Keuls test than it is using Duncan’s 
procedure. This is illustrated below for the case a = 0.01, a = 8, and / = 20: 


p 

2 

3 

4 

5 

6 

7 

8 

r om (p, 20) 

4.02 

4.22 

4.33 

4.40 

4.47 

4.53 

4.58 

<lo.oi(P- 20) 

4.02 

4.64 

5.02 

5.29 

5.51 

5.69 

5.84 


As indicated above, there are several other multiple comparison procedures. For 
articles describing these methods, see Miller (1977), O’Neill and Wetherill (1971), and 
Nelson (1989). The book by Miller (1991) is also recommended. 


3-5.8 Comparing Treatment Means with a Control 


In many experiments, one of the treatments is a control, and the analyst is interested in 
comparing each of the other a - 1 treatment means with the control. Thus, there are 
only a — 1 comparisons to be made. A procedure for making these comparisons has been 
developed by Dunnett (1964). Suppose that treatment a is the control and we wish to 
test the hypotheses 

Ho -Hi = Ha 

Hi: /Jbj =£ fi a 

for i = 1, 2 a — 1. Dunnett’s procedure is a modification of the usual i-test. For 
each hypothesis, we compute the observed differences in the sample means 
Itt. “5U i = 1, 2, ...,a - 1 

The null hypothesis H 0 : /r, = fi a is rejected using a type I error rate a if 

I % “ %.\ > d a (a - 1, /) iWQ- + (3-46) 

where the constant d a (a - 1,/) is given in Appendix Table IX. (Both two- and one¬ 
sided tests are possible.) Note that a is the joint significance level associated with all 
a - 1 tests. 


EXAMPLE 3-10 . 

To illustrate Dunnett’s test, consider the experiment from Example 3-1 with treatment 5 
considered the control. In this example, a = 5, a — 1 = 4,/ = 20, and n, — n = 5. At 
the 5 percent level, we find from Appendix Table IX that d 0 05 (4, 20) = 2.65. Thus, the 
critical difference becomes 


= 4.76 


(Note that this is a simplification of Equation 3-46 resulting from a balanced design.) 
Thus, any treatment mean that differs from the control by more than 4.76 would be 
declared significantly different. The observed differences are 


1 vs. 5: y u - y 5 . = 9.8 - 10.8 = -1.0 

2 vs. 5: y 2 . ~ y 5 . = 15.4 - 10.8 = 4.6 

3 vs. 5: y 3 . - y 5 . = 17.6 - 10.8 = 6.8 

4 vs. 5: y 4 . — y 5 . = 21.6 — 10.8 = 10.8 
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Only the differences y 3 , - y 5 _ and y 4 . - y 5 . indicate any significant difference when com¬ 
pared to the control; thus, we conclude that /r 3 A (jl 5 and p 4 A p 5 . 

When comparing treatments with a control, it is a good idea to use more observations 
for the control treatment (say n a ) than for the other treatments (say n), assuming equal 
numbers of observations for the remaining a - 1 treatments. The ratio njn should be 
chosen to be approximately equal to the square root of the total number of treatments. 
That is, choose njn = Va. 


3-6 SAMPLE COMPUTER OUTPUT 


Computer programs for supporting experimental design and performing the analysis 
of variance are widely available. The output from one such program, Design-Expert, 
is shown in Figure 3-15 (on the facing page), using the data from the single-factor 
experiment in Example 3-1. The sum of squares corresponding to the “Model” is the 
usual SS-rreatments for a single-factor design. That source is further identified as “A. ” When 
there is more than one factor in the experiment, the model sum of squares will be de¬ 
composed into several sources (A, B, etc.). Notice that the analysis of variance summary 
at the top of the computer output contains the usual sums of squares, degrees of freedom, 
mean squares, and test statistic F 0 . The column “Prob > F” is the P-value (actually, the 
upper bound on the P-value, because probabilities less than 0.0001 are defaulted to 
0.0001). 

In addition to the basic analysis of variance, the program displays some additional 
useful information. The quantity “R-squared” is defined as 


R 2 



475.76 

636.96 


= 0.746923 


and is loosely interpreted as the proportion of the variability in the data “explained” by 
the analysis of variance model. Thus, in the synthetic fiber strength testing data, the 
factor “cotton weight percentage” explains about 74.69 percent of the variability in 
tensile strength. Clearly, we must have 0 =£ R 2 *£ 1, with larger values being more 
desirable. There are also some other P 2 -like statistics displayed in the output. The “ad¬ 
justed” R 2 is a variation of the ordinary R 2 statistic that reflects the number of factors in 
the model. It can be a useful statistic for more complex experiments with several design 
factors when we wish to evaluate the impact of increasing or decreasin g the n umber of 
model terms. “Std.Dev.” is the square root of the error mean square, V8.060 = 2.839, 
and “C.V.” is the coefficient of variation, defined as (VM.S,,/y)100. The coefficient of 
variation measures the unexplained or residual variability in the data as a percentage 
of the mean of the response variable. “PRESS” stands for “Prediction Error Sum of 
Squares,” and it is a measure of how well the model for the experiment is likely to 
predict the responses in a new experiment. Small values of PRESS are desirable. Alter¬ 
natively, one can calculate an R 2 for predictions based on PRESS (we will show how to 
do this later). This R j rcd in our problem is 0.6046, which is not unreasonable, considering 
that the model accounts for about 75 percent of the variability in the current experiment. 
The “Adequate Precision” statistic is computed by dividing the difference between the 
maximum predicted response and the minimum predicted response by the average stan¬ 
dard deviation of all predicted responses. Large values of this quantity are desirable, and 
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Response: Strength in psi 

ANOVA for Selected Factorial Model 
Analysis of variance table [Partial sum of squares] 


Sum of 

Source Squares 

Model 475.76 

A 475.76 

Residual 161.20 

Lack of Fit 0.000 

Pure Error 161.20 

Cor Total 636.96 


Mean F 

DF Square Value 

4 118.94 14.76 

4 118.94 14.76 

20 8.06 

0 

20 8.06 

24 


Prob > F 

<0.0001 significant 
< 0.0001 


The Model F-value of 14.76 implies the model is significant. There is only a 0.01% chance 
that a "Model F-Value" this large could occur due to noise. 


Values of "Prob > F" less than 0.0500 indicate model terms are significant. 

In this case A are significant model terms. 

Values greater than 0.1000 indicate the model terms are not significant. 

If there are many insignificant model terms (not counting those required to support hi- 


erarchy). 

model reduction may improve your model. 


Std. Dev. 

2.84 

R-Squared 

0.7469 

Mean 

15.04 

Adj R-Squared 

0.6963 

C.V. 

18.88 

Pred R-Squared 

0.6046 

PRESS 

251.88 

Adeq Precision 

9.294 


The "Pred R-Squared" of 0.6046 is in reasonable agreement with the "Adj R-Squared" of 
0.6963. A difference greater than 0.20 between the "Pred R-Squared" and the "Adj R- 
Squared" indicates a possible problem with your model and/or data. 

"Adeq Precision" measures the signal to noise ratio. A ratio greater than 4 is desirable. 
Your ratio of 9.294 indicates an adequate signal to use this model to navigate the design 
space. 

Treatment Means (Adjusted, If Necessary) 



Estimated 

Standard 




Mean 

Error 



1-15 

9.80 

1.27 



2-20 

15.40 

1.27 



3-25 

17.60 

1.27 



4-30 

21.60 

1.27 



5-35 

10.80 

1.27 




Mean 

Standard 

t for H 0 


Treatment 

Difference 

DF Error 

Coeff=0 

Prob > t 

1 vs 2 

-5.60 

1 1.80 

-3.12 

0.0054 

1 vs 3 

-7.80 

1 1.80 

-4.34 

0.0003 

1 vs 4 

-11.80 

1 1.80 

-6.57 

<0.0001 

1 vs 5 

-1.00 

1 1.80 

-0.56 

0.5838 

2 vs 3 

-2.20 

1 1.80 

-1.23 

0.2347 

2 vs 4 

-6.20 

1 1.80 

-3.45 

0.0025 

2 vs 5 

4.60 

1 1.80 

2.56 

0.0186 

3 vs 4 

-4.00 

1 1.80 

-2.23 

0.0375 

3 vs 5 

6.80 

1 1.80 

3.79 

0.0012 

4 vs 5 

10.80 

1 1.80 

6.01 

<0.0001 

Values of "Prob 

> 11" less than 0.0500 indicate the difference in the two treatment means 


is significant. 

Values of "Prob > |t|" greater than 0.1000 indicate the difference in the two treatment 
means is not significant. 

Figure 3-15 Design-Expert computer output for Example 3-1. 
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Diagnostic Case Statistics 
Standard Actual Predicted 
Order Value Value 

Residual 

Leverage 

Student 

Residual 

Cook's 

Distance 

Outlier t 

1 

7.00 

9.80 

-2.80 

0.200 

-1.103 

0.061 

-1.109 

2 

7.00 

9.80 

-2.80 

0.200 

-1.103 

0.061 

-1.109 

3 

15.00 

9.80 

5.20 

0.200 

2.048 

0.210 

2.245 

4 

11.00 

9.80 

1.20 

0.200 

0.473 

0.011 

0.463 

5 

9.00 

9.80 

-0.80 

0.200 

-0.315 

0.005 

-0.308 

6 

12.00 

15.40 

-3.40 

0.200 

-1.339 

0.090 

-1.368 

7 

17.00 

15.40 

1.60 

0.200 

0.630 

0.020 

0.620 

8 

12.00 

15.40 

-3.40 

0.200 

-1.339 

0.090 

-1.368 

9 

18.00 

15.40 

2.60 

0.200 

1.024 

0.052 

1.025 

10 

18.00 

15.40 

2.60 

0.200 

1.024 

0.052 

1.025 

11 

14.00 

17.60 

-3.60 

0.200 

-1.418 

0.100 

-1.457 

12 

18.00 

17.60 

0.40 

0.200 

0.158 

0.001 

0.154 

13 

18.00 

17.60 

0.40 

0.200 

0.158 

0.001 

0.154 

14 

19.00 

17.60 

1.40 

0.200 

0.551 

0.015 

0.542 

15 

19.00 

17.60 

1.40 

0.200 

0.551 

0.015 

0.542 

16 

19.00 

21.60 

-2.60 

0.200 

-1.024 

0.052 

-1.025 

17 

25.00 

21.60 

3.40 

0.200 

1.339 

0.090 

1.368 

18 

22.00 

21.60 

0.40 

0.200 

0.158 

0.001 

0.154 

19 

19.00 

21.60 

-2.60 

0.200 

-1.024 

0.052 

-1.025 

20 

23.00 

21.60 

1.40 

0.200 

0.551 

0.015 

0.542 

21 

7.00 

10.80 

-3.80 

0.200 

-1.496 

0.112 

-1.548 

22 

10.00 

10.80 

-0.80 

0.200 

-0.315 

0.005 

-0.308 

23 

11.00 

10.80 

0.20 

0.200 

0.079 

0.000 

0.077 

24 

15.00 

10.80 

4.20 

0.200 

1.654 

0.137 

1.735 

25 

11.00 

10.80 

0.20 

0.200 

0.079 

0.000 

0.077 


Proceed to Diagnostic Plots (the next icon in progression). Be sure to look at the: 

1) Normal probability plot of the studentized residuals to check for normality of 
residuals. 

2) Studentized residuals versus predicted values to check for constant error. 

3) Outlier t versus run order to look for outliers, i.e., influential values. 

4) Box-Cox plot for power transformations. 

If all the model statistics and diagnostic plots are OK, finish up with the Model Graphs 
icon. 

Figure 3-15 (Continued) 


values that exceed four usually indicate that the model will give reasonable performance 
in prediction. 

Treatment means are e stimate d and the standard error (or sample standard deviation 
of each treatment mean, \/MSjn) is displayed. Differences between pairs of treatment 
means are investigated by using the Fisher LSD method described in Section 3-5.7. 

The computer program also calculates and displays the residuals, as defined in Equa¬ 
tion 3-16. The program will also produce all of the residual plots that we discussed in 
Section 3-4. There are also several other residual diagnostics displayed in the output. 
Some of these will be discussed later. 

Finally, notice that the computer program also has some interpretative guidance 
embedded in the output. This “advisory” information is fairly standard in many PC- 
based statistics packages. Remember in reading such guidance that it is written in very 
general terms and may not exactly suit the report writing requirements of any specific 
experimenter. This advisory output may be deleted by the user. 
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3-7 DETERMINING SAMPLE SIZE 

In any experimental design problem, a critical decision is the choice of sample size— 
that is, determining the number of replicates to run. Generally, if the experimenter is 
interested in detecting small effects, more replicates are required than if the experimenter 
is interested in detecting large effects. In this section, we discuss several approaches to 
determining sample size. Although our discussion focuses on a single-factor design, most 
of the methods can be used in more complex experimental situations. 


3»7.1 Operating Characteristic Curves 

Recall that an operating characteristic curve is a plot of the type II error probability 
of a statistical test for a particular sample size versus a parameter that reflects the extent 
to which the null hypothesis is false. These curves can be used to guide the experimenter 
in selecting the number of replicates so that the design will be sensitive to important 
potential differences in the treatments. 

We consider the probability of type II error of the fixed effects model for the case 
of equal sample sizes per treatment, say 

>3=1— P{Reject H 0 \H 0 is false} 

= 1 - P{F 0 > F a , a -i,N- a \Ho is false} (3-47) 

To evaluate the probability statement in Equation 3-47, we need to know the distribution 
of the test statistic F 0 if the null hypothesis is false. It can be shown that, if H 0 is false, 
the statistic F 0 = MS TieMments /MS E is distributed as a noncentral F random variable with 
a - 1 and N - a degrees of freedom and the noncentrality parameter 8. If 8 = 0, the 
noncentral F distribution becomes the usual (central) F distribution. 

Operating characteristic curves given in Chart V of the Appendix are used to evaluate 
the probability statement in Equation 3-47. These curves plot the probability of type II 
error (j8) against a parameter d>, where 


n jS T ? 

4> 2 = ' =1 2 (3-48) 

aa 

The quantity $ 2 is related to the noncentrality parameter 8. Curves are available for 
a = 0.05 and a = 0.01 and a range of degrees of freedom for numerator and denominator. 

In using the operating characteristic curves, the experimenter must specify the pa¬ 
rameter O. This is often difficult to do in practice. One way to determine is to choose 
the actual values of the treatment means for which we would like to reject the null 
hypothesis with high probability. Thus, if fi u ..., /x a are the specified treatment 
means, we find the t, in Equation 3-48 as t, = /a, — JL, where Ji = ( l/a)2“ -, /a, is the 
average of the individual treatment means. We also require an estimate of a 2 . Sometimes 
this is available from prior experience, a previous experiment or a preliminary test (as 
suggested in Chapter 1), or a judgment estimate. When we are uncertain about the value 
of cr 2 , sample sizes could be determined for a range of likely values of a 2 to study the 
effect of this parameter on the required sample size before a final choice is made. 
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EXAMPLE 3-11 . 

Consider the tensile strength experiment described in Example 3-1. Suppose that the 
experimenter is interested in rejecting the null hypothesis with a probability of at least 
0.90 if the five treatment means are 

fii = 11 Ho.= 12 iu. 3 = 15 /r 4 = 18 and /jl 5 = 19 

She plans to use a = 0.01. In this case, because = 75, we have Ja = (1/5)75 = 

15 and 


T| = fjb\ /x = 11 15 = —4 

t 2 = )U2 — /I = 12 - 15 = —3 

t 3 = — jl = 15 — 15 = 0 

T 4 = /Li 4 - = 18 - 15 = 3 

t 5 = n 5 - /Z = 19 - 15 = 4 


Thus, 2f =1 rf = 50. Suppose the experimenter feels that the standard deviation of tensile 
strength at any particular level of cotton weight percentage will be no larger than a = 3 
psi. Then, by using Equation 3-48, we have 


<p 2 



l.lln 


We use the operating characteristic curve for a - 1 = 5 - 1 = 4 with N - a = 
a(n — 1) = 5(« — 1) error degrees of freedom and a = 0.01 (see Appendix Chart 
V). As a first guess at the required sample size, try n = 4 replicates. This yields 
O 2 = 1.11(4) = 4.44, $ = 2.11, and 5(3) = 15 error degrees of freedom. Consequently, 
from Chart V, we find that /3 = 0.30. Therefore, the power of the test is approximately 
1 — /3 = 1 — 0.30 = 0.70, which is less than the required 0.90, and so we conclude that 
n = 4 replicates are not sufficient. Proceeding in a similar manner, we can construct the 
following display: 


n 4> 

a{n - 1) 

P 

Power (1-/3) 

4 4.44 2.11 

15 

0.30 

0.70 

5 5.55 2.36 

20 

0.15 

0.85 

6 6.66 2.58 

25 

0.04 

0.96 

Thus, at least n = 6 replicates must be run to obtain a test with the required power. 


The only problem with this approach to using the operating characteristic curves is 
that it is usually difficult to select a set of treatment means on which the sample size 
decision should be based. An alternate approach is to select a sample size such that if 
the difference between any two treatment means exceeds a specified value the null hy¬ 
pothesis should be rejected. If the difference between any two treatment means is as large 
as D, it can be shown that the minimum value of <b 2 is 
nD 2 
2a (T 2 


<I > 2 


(3-49) 
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Because this is a minimum value of O 2 , the corresponding sample size obtained from 
the operating characteristic curve is a conservative value; that is, it provides a power at 
least as great as that specified by the experimenter. 

To illustrate this approach, suppose that in the tensile strength experiment from 
Example 3-1, the experimenter wished to reject the null hypothesis with probability at 
least 0.90 if any two treatment means differed by as much as 10 psi. Then, assuming that 
cr = 3 psi, we find the minimum value of <I> 2 to be 


*(10) 

2(5)(3 2 ) 


and, from the analysis in Example 3-11, we conclude that n = 6 replicates are required 
to give the desired sensitivity when a = 0.01. 


3-7.2 Specifying a Standard Deviation Increase 

This approach is occasionally helpful in choosing the sample size. If the treatment means 
do not differ, the standard deviation of an observation chosen at random is cr. If the 
treatment means are different, however, the standard deviation of a randomly chosen 
observation is 


J * 2 + (| T?/ ») 

If we choose a percentage P for the increase in the standard deviation of an observation 
beyond which we wish to reject the hypothesis that all treatment means are equal, this 
is equivalent to choosing 

ft + (i rf/aj 

-= 1 + 0.0 IP (P = percent) 


or 


so that 



V(1 + 0.01P) 2 - 1 



V(1 + 0.01P) 2 - ](Vn) 


(3-50) 


Thus, for a specified value of P, we may compute d> from Equation 3-50 and then use 
the operating characteristic curves in Appendix Chart V to determine the required sample 
size. 
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For example, in the tensile strength experiment from Example 3-1, suppose that we 
wish to detect a standard deviation increase of 20 percent with a probability of at least 
0.90 and a = 0.05. Then 

O = V(1.2) 2 - l(Vn) = 0.66 Vn 

Reference to the operating characteristic curves shows that n = 9 is required to give the 
desired sensitivity. 


3-7.3 Confidence Interval Estimation Method 


This approach assumes that the experimenter wishes to express the final results in terms 
of confidence intervals and is willing to specify in advance how wide he or she wants 
these confidence intervals to be. For example, suppose that in the tensile strength exper¬ 
iment from Example 3-1 we wanted a 95 percent confidence interval on the difference 
in mean tensile strength for any two cotton weight percentages to be ±5 psi and a prior 
estimate of cris 3. Then, using Equation 3-13, we find that the accuracy of the confidence 
interval is 



Suppose that we try n = 5 replicates. Then, using a 2 = 3 2 = 9 as an estimate of MS E , 
the accuracy of the confidence interval becomes 



= ±3.96 


which is more accurate than the requirement. Trying n = 4 gives 


Trying n = 3 gives 



±4.52 



±5.46 


Clearly n = 4 is the smallest sample size that will lead to the desired accuracy. 

The quoted level of significance in the above illustration applies only to one confi¬ 
dence interval. However, the same general approach can be used if the experimenter 
wishes to prespecify a set of confidence intervals about which a joint or simultaneous 
confidence statement is made (see the comments about simultaneous confidence inter¬ 
vals in Section 3-3.3). Furthermore, the confidence intervals could be constructed about 
more general contrasts in the treatment means than the pairwise comparison illustrated 
above. 


3-8 DISCOVERING DISPERSION EFFECTS 

We have focused on using the analysis of variance and related methods to determine 
which factor levels result in differences among treatment or factor level means. It is 
customary to refer to these effects as location effects. If there was inequality of variance 
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Table 3-12 Data for the Smelting Experiment 


Ratio 



Observations 



Algorithm 

1 

2 

3 

4 

5 

6 

1 

4.93(0.05) 

4.86(0.04) 

4.75(0.05) 

4.95(0.06) 

4.79(0.03) 

4.88(0.05) 

2 

4.85(0.04) 

4.91(0.02) 

4.79(0.03) 

4.85(0.05) 

4.75(0.03) 

4.85(0.02) 

3 

4.83(0.09) 

4.88(0.13) 

4.90(0.11) 

4.75(0.15) 

4.82(0.08) 

4.90(0.12) 

4 

4.89(0.03) 

4.77(0.04) 

4.94(0.05) 

4.86(0.05) 

4.79(0.03) 

4.76(0.02) 


at the different factor levels, we used transformations to stabilize the variance to improve 
our inference on the location effects. In some problems, however, we are interested in 
discovering whether the different factor levels affect variability; that is, we are interested 
in discovering potential dispersion effects. This will occur whenever the standard de¬ 
viation, variance, or some other measure of variability is used as a response variable. 

To illustrate these ideas, consider the data in Table 3-12, which resulted from a 
designed experiment in an aluminum smelter. Aluminum is produced by combining alu¬ 
mina with other ingredients in a reaction cell and applying heat by passing electric current 
through the cell. Alumina is added continuously to the cell to maintain the proper ratio 
of alumina to other ingredients. Four different ratio control algorithms were investigated 
in this experiment. The response variables studied were related to cell voltage. Specifi¬ 
cally, a sensor scans cell voltage several times each second, producing thousands of 
voltage measurements during each run of the experiment. The process engineers decided 
to use the average voltage and the standard deviation of cell voltage (shown in paren¬ 
theses) over the run as the response variables. The average voltage is important because 
it affects cell temperature, and the standard deviation of voltage (called “pot noise” by 
the process engineers) is important because it affects the overall cell efficiency. 

An analysis of variance was performed to determine if the different ratio control 
algorithms affect average cell voltage. This revealed that the ratio control algorithm had 
no location effect; that is, changing the ratio control algorithms does not change the 
average cell voltage. (Refer to Problem 3-28.) 

To investigate dispersion effects, it is usually best to use 
log(s) or logfs 2 ) 

as a response variable since the log transformation is effective in stabilizing variability 
in the distribution of the sample standard deviation. Because all sample standard devia¬ 
tions of pot voltage are less than unity, we will use 
y = -Infs') 

as the response variable. Table 3-13 presents the analysis of variance for this response, 
the natural logarithm of “pot noise’.” Notice that the choice of a ratio control algorithm 


Table 3-13 Analysis of Variance for the Natural Logarithm of Pot Noise 
Sum of Degrees of Mean 

Source of Variation Squares Freedom Square F 0 P-Value 

Ratio Control Algorithm 6.166 3 2.055 21.96 <0.001 

Error 1.S72 20 0.094 

Total 8.038 23 
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, 3_ j 1 4 2 _!_ 

2.00 3.00 4.00 

Average log pot noise [-In (s)l 

Figure 3-16 Average log pot noise [-In (s)] for four ratio control 
algorithms relative to a scaled t distribution with scale factor 
VmSJ, n = V0.094/6 = 0.125. 


affects pot noise; that is, the ratio control algorithm has a dispersion effect. Standard 
tests of model adequacy, including normal probability plots of the residuals, indicate that 
there are no problems with experimental validity. (Refer to Problem 3-29.) 

Figure 3-16 plots the average log pot noise for each ratio control algorithm and also 
presents a scaled t distribution for use as a reference distribution in discriminating 
between ratio control algorithms. This plot clearly reveals that ratio control algorithm 3 
produces greater pot noise or greater cell voltage standard deviation than the other al¬ 
gorithms. There does not seem to be much difference between algorithms 1, 2, and 4. 


3*9 THE REGRESSION APPROACH TO THE ANALYSIS 
OF VARIANCE 

We have given an intuitive of heuristic development of the analysis of variance. However, 
it is possible to give a more formal development. The method will be useful later in 
understanding the basis for the statistical analysis of more complex designs. Called the 
general regression significance test, the procedure essentially consists of finding the 
reduction in the total sum of squares for fitting the model with all parameters included 
and the reduction in sum of squares when the model is restricted to the null hypotheses. 
The difference between these two sums of squares is the treatment sum of squares with 
which a test of the null hypothesis can be conducted. The procedure requires the least 
squares estimators of the parameters in the analysis of variance model. We have given 
these parameter estimates previously (in Section 3-3.3); however, we now give a formal 
development. 


3-9.1 Least Squares Estimation of the Model Parameters 

We now develop estimators for the parameter in the single-factor model 

y,j = fL + Tj + e y 

using the method of least squares. To find the least squares estimators of yu, and r,, we 
first form the sum of squares of the errors 


l = i 2 4 = 2 2 (yij - v - t ,) 2 


( 3 - 51 ) 
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and then choose values of /x and t„ say fi and f„ that minimize L. The appropriate values 
would be the solutions to the a + 1 simultaneous equations 


dL_\ 

sl\ 


= 0 

= 0 


i = 1,2,..., a 


Differentiating Equation 3-51 with respect to fi and t, and equating to zero, we obtain 


-2 E E (y,j - fi - T,) = o 

and 


-2 E (y„ ~ fi - *,) = 0 i= 1,2,..., a 

which, after simplification, yield 

N(L + nf] + nf 2 + • ■ • + nf a = y_ 
nfi + nfi = y u 

nfi + n? 2 = y 2 . (3-52) 


nfi, + nf a = y a . 

The a + 1 equations (Equation 3-52) in a + 1 unknowns are called the least squares 
normal equations. Notice that if we add the last a normal equations, we obtain the first 
normal equation. Therefore, the normal equations are not linearly independent, and no 

unique solution for /x, Ti.r a exists. This difficulty can be overcome by several 

methods. Because we have defined the treatment effects as deviations from the overall 
mean, it seems reasonable to apply the constraint 

E fi = 0 (3-53) 

Using this constraint, we obtain as the solution to the normal equations 

fi = y.. (3-54) 

f, = y ; . — y_ i = 1, 2,..., a 

This solution is obviously not unique and depends on the constraint (Equation 3-53) 
that we have chosen. At first this may seem unfortunate because two different experi¬ 
menters could analyze the same data and obtain different results if they apply different 
constraints. However, certain functions of the model parameter are uniquely estimated, 
regardless of the constraint. Some examples are r, - t } , which would be estimated by 
f, - fj = y L — yj_, and the z'th treatment mean /x, = /i + r„ which would be estimated 
by fit = fi + = y im . 

Because we are usually interested in differences among the treatment effects rather 
than their actual values, it causes no concern that the t, cannot be uniquely estimated. In 
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general, any function of the model parameters that is a linear combination of the left- 
hand side of the normal equations (Equations 3-52) can be uniquely estimated. Functions 
that are uniquely estimated regardless of which constraint is used are called estimable 
functions. For more information, see the supplemental material for this chapter. We are 
now ready to use these parameter estimates in a general development of the analysis of 
variance. 


3-9.2 The General Regression Significance Test 

A fundamental part of this procedure is writing the normal equations for the model. 
These equations may always be obtained by forming the least squares function and 
differentiating it with respect to each unknown parameter, as we did in Section 3-9.1. 
However, an easier method is available. The following rules allow the normal equations 
for any experimental design model to be written directly: 

RULE 1. There is one normal equation for each parameter in the model to be 
estimated. 

RULE 2. The right-hand side of any normal equation is just the sum of all obser¬ 
vations that contain the parameter associated with that particular normal equation. 

To illustrate this rule, consider the single-factor model. The first normal equa¬ 
tion is for the parameter /x; therefore, the right-hand side is y„ because all obser¬ 
vations contain /a. 

RULE 3. The left-hand side of any normal equation is the sum of all model param¬ 
eters, where each parameter is multiplied by the number of times it appears in the 
total on the right-hand side. The parameters are written with a circumflex O to 
indicate that they are estimators and not the true parameter values. 

For example, consider the first normal equation in a single-factor experiment. Ac¬ 
cording to the above rules, it would be 

Nfi + nf x + nf 2 + ■ ■ ■ + nf a = y„ 

because /*. appears in all N observations, Ti appears only in the n observations taken 
under the first treatment, r 2 appears only in the n observations taken under the second 
treatment, and so on. From Equation 3-52, we verify that the equation shown above is 
correct. The second normal equation would correspond to T| and is 

n(i + nfi = y K 

because only the observations in the first treatment contain T! (this gives y u as the right- 
hand side), /jl and r, appear exactly n times in y u , and all other t, appear zero times. In 
general, the left-hand side of any normal equation is the expected value of the right-hand 
side. 

Now, consider finding the reduction in the sum of squares by fitting a particular 
model to the data. By fitting a model to the data, we “explain” some of the variability; 
that is, we reduce the unexplained variability by some amount. The reduction in the 
unexplained variability is always the sum of the parameter estimates, each multiplied by 



3-9 THE REGRESSION APPROACH TO THE ANALYSIS OF VARIANCE 115 

the right-hand side of the normal equation that corresponds to that parameter. For ex¬ 
ample, in a single-factor experiment, the reduction due to fitting the full model y tj = 
(l + Tj + £,y is 


/?0, r) = jxy„ + TOT. + for. + • • • + T a y a . 

= Ax. + X fffu (3-55) 

The notation R(/x, t) means that reduction in the sum of squares from fitting the model 
containing ju. and {r, }. R(jx, t) is also sometimes called the “regression” sum of squares 
for the full model y i} = /x + t ( + The number of degrees of freedom associated with 
a reduction in the sum of squares, such as /?(/x, r), is always equal to the number of 
linearly independent normal equations. The remaining variability unaccounted for by the 
model is found from 


SS E = X X yl ~ R (^ t) (3-56) 

This quantity is used in the denominator of the test statistic for H 0 : t, = r 2 = ■ • • = 

T a = 0 . 

We now illustrate the general regression significance test for a single-factor exper¬ 
iment and show that it yields the usual one-way analysis of variance. The model is y 0 = 
H + Ti + e-ij, and the normal equations are found from the above rules as 

Nfi + nf] + nf 2 + • • • + nf a = y„ 

«A + nfi = yi. 

n(L + nf 2 = y 2 . 

«A + nf a = y a . 

Compare these normal equations with those obtained in Equation 3-52. 

Applying the constraint = 0, the estimators for /x and r, are 

A = y. ti = % ~y.. i = 1,2,..., a 


The reduction in the sum of squares due to fitting this full model is found from Equation 
3-55 as 
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which has a degrees of freedom because there are a linearly independent normal equa¬ 
tions. The error sum of squares is, from Equation 3-56, 

ss, = i S yfj - Kfji, T) 


and has N — a degrees of freedom. 

To find the sum of squares resulting from the treatment effects (the {r,}), we consider 
the model to be restricted to the null hypothesis; that is, r, = 0 for all i. The reduced 
model is y, 7 = fx + There is only one normal equation for this model: 

N(x = y_ 

and the estimator of /x is (x = y_. Thus, the reduction in the sum of squares that results 
from fitting the reduced model containing only /x is 

Rip) = iyJiyJ = jj 

Because there is only one normal equation for this reduced model, R(fx) has one degree 
of freedom. The sum of squares due to the {t,}, given that jx is already in the model, is 
the difference between R(fx, t) and R(/x), which is 

*(t|/i) = R(fi, t) - RQi) 

= /?(Full Model) - /{(Reduced Model) 


with a - 1 degrees of freedom, which we recognize from Equation 3-9 as ^Treatments- 
Making the usual normality assumption, the appropriate statistic for testing H 0 : T] = 
t 2 = ... = Ta = o is 


Z?(t| ix)/(a - 1) 


2 yfj - Rip’ T)j/(/V - a) 


which is distributed as F a ~ 1>JV _ a under the null hypothesis. This is, of course, the test 
statistic for the single-factor analysis of variance. 


3-10 NONPARAMETRIC METHODS IN THE ANALYSIS 
OF VARIANCE 

3-10.1 The Kruskal-Waiiis Test 

In situations where the normality assumption is unjustified, the experimenter may wish 
to use an alternative procedure to the F test analysis of variance that does not depend on 
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this assumption. Such a procedure has been developed by Kruskal and Wallis (1952). 
The Kruskal-Wallis test is used to test the null hypothesis that the a treatments are 
identical against the alternative hypothesis that some of the treatments generate obser¬ 
vations that are larger than others. Because the procedure is designed to be sensitive for 
testing differences in means, it is sometimes convenient to think of the Kruskal-Wallis 
test as a test for equality of treatment means. The Kruskal-Wallis test is a nonparametric 
alternative to the usual analysis of variance. 

To perform a Kruskal-Wallis test, first rank the observations y u in ascending order 
and replace each observation by its rank, say /?, y , with the smallest observation having 
rank 1. In the case of ties (observations having the same value), assign the average rank 
to each of the tied observations. Let R L be the sum of the ranks in the ith treatment. The 
test statistic is 


j_ r v m _ w n + 1)2 ~ 

S 2 n t 4 _ 


where n t is the number of observations in the ith treatment, N is the total number of 
observations, and 

<3 - 58) 

Note that S 2 is just the variance of the ranks. If there are no ties, S 2 = N(N + 1)/12, and 
the test statistic simplifies to 


N(N + 1) n, 


t - - 3(N -I 


(3-59) 


When the number of ties is moderate, there will be little difference between Equations 
3-57 and 3-59, and the simpler form (Equation 3-59) may be used. If the n, are reasonably 
large, say n, 3= 5, H is distributed approximately as xl-\ under the null hypothesis. 
Therefore, if 


H > Xl,a -1 

the null hypothesis is rejected. The P-value approach could also be used. 


EXAMPLE 3-12 . 

The data from Example 3-1 and their corresponding ranks are shown in Table 3-14 on 
the next page. Because there is a fairly large number of ties, we use Equation 3-57 as 
the test statistic. From Equation 3-58 we find 




= 53.03 
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Table 3-14 Data and Ranks for the Tensile Testing Experiment in Example 3-1 


Weight Percentage of Cotton 

15 

20 


25 

30 

35 

yij 

Ry, 

yij 

R 2j 

yy 

Ry 

yy 

R 4 j 

yy 

Ry 

7 

2.0 

12 

9.5 

14 

11.0 

19 

20.5 

7 

2.0 

7 

2.0 

17 

14.0 

18 

16.5 

25 

25.0 

10 

5.0 

15 

12.5 

12 

9.5 

18 

16.5 

22 

23.0 

11 

7.0 

11 

7.0 

18 

16.5 

19 

20.5 

19 

20.5 

15 

12.5 

9 

4.0 

18 

16.5 

19 

20.5 

23 

24.0 

11 

7.0 

R l 

27.5 


66.0 


85.0 


113.0 


33.5 


and the test statistic is 


H 



Rt _ N(N + 1) 2 1 

«, 4 J 


= 19.25 

Because H > Xo.oia = 13.28, we would reject the null hypothesis and conclude that the 
treatments differ. (The P-value for H = 19.25 is P = 0.0002.) This is the same conclusion 
given by the usual analysis of variance F test. 


3-10.2 General Comments on the Rank Transformation 

The procedure used in the previous section of replacing the observations by their ranks 
is called the rank transformation. It is a very powerful and widely useful technique. If 
we were to apply the ordinary F test to the ranks rather than to the original data, we 
would obtain 


Hl(a - 1) 

(N - 1 - H)/(N - a) 


(3-60) 


as the test statistic [see Conover (1980), p. 337]. Note that, as the Kruskal-Wallis statistic 
H increases or decreases, F 0 also increases or decreases, so the Kruskal-Wallis test is 
equivalent to applying the usual analysis of variance to the ranks. 

The rank transformation has wide applicability in experimental design problems for 
which no nonparametric alternative to the analysis of variance exists. This includes many 
of the designs in subsequent chapters of this book. If the data are ranked and the ordinary 
F test is applied, an approximate procedure that has good statistical properties results 
[see Conover and Iman (1976, 1981)]. When we are concerned about the normality 
assumption or the effect of outliers or “wild” values, we recommend that the usual 
analysis of variance be performed on both the original data and the ranks. When both 
procedures give similar results, the analysis of variance assumptions are probably satis¬ 
fied reasonably well, and the standard analysis is satisfactory. When the two procedures 
differ, the rank transformation should be preferred because it is less likely to be distorted 
by nonnormality and unusual observations. In such cases, the experimenter may want to 
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investigate the use of transformations for nonnormality and examine the data and the 
experimental procedure to determine if outliers are present and why they have occurred. 

3-11 PROBLEMS- 

3-1. The tensile strength of portland cement is being studied. Four different mixing techniques 
can be used economically. The following data have been collected: 


Mixing Technique 


Tensile Strength (lb/in 2 ) 


1 

3129 

3000 

2865 

2890 

2 

3200 

3300 

2975 

3150 

3 

2800 

2900 

2985 

3050 

4 

2600 

2700 

2600 

2765 


(a) Test the hypothesis that mixing techniques affect the strength of the cement. Use 
a = 0.05. 

(b) Construct a graphical display as described in Section 3-5.3 to compare the mean 
tensile strengths for the four mixing techniques. What are your conclusions? 

(c) Use the Fisher LSD method with a = 0.05 to make comparisons between pairs of 
means. 

(d) Construct a normal probability plot of the residuals. What conclusion would you 
draw about the validity of the normality assumption? 

(e) Plot the residuals versus the predicted tensile strength. Comment on the plot. 

(f) Prepare a scatter plot of the results to aid the interpretation of the results of this 
experiment. 

3-2. (a) Rework part (b) of Problem 3-1 using Duncan’s multiple range test with a = 0.05. 

Does this make any difference in your conclusions? 

(b) Rework part (b) of Problem 3-1 using Tukey’s test with a = 0.05. Do you get the 
same conclusions from Tukey’s test that you did from the graphical procedure and/ 
or Duncan’s multiple range test? 

(c) Explain the difference between the Tukey and Duncan procedures. 

3-3. Reconsider the experiment in Problem 3-1. Find a 95 percent confidence interval on the 
mean tensile strength of the portland cement produced by each of the four mixing tech¬ 
niques. Also find a 95 percent confidence interval on the difference in means for techniques 
1 and 3. Does this aid you in interpreting the results of the experiment? 

3-4. An experiment was run to determine whether four specific firing temperatures affect the 
density of a certain type of brick. The experiment led to the following data: 


Temperature 



Density 



100 

21.8 

21.9 

21.7 

21.6 

21.7 

125 

21.7 

21.4 

21.5 

21.4 


150 

21.9 

21.8 

21.8 

21.6 

21.5 

175 

21.9 

21.7 

21.8 

21.4 



(a) Does the firing temperature affect the density of the bricks? Use a — 0.05. 

(b) Is it appropriate to compare the means using Duncan’s multiple range test (for ex¬ 
ample) in this experiment? 

(c) Analyze the residuals from this experiment. Are the analysis of variance assumptions 
satisfied? 

(d) Construct a graphical display of the treatment as described in Section 3-5.3. Does 
this graph adequately summarize the results of the analysis of variance in part (a)? 
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3-5. Rework part (d) of Problem 3-4 using the Fisher LSD method. What conclusions can you 
draw? Explain carefully how you modified the technique to account for unequal sample 
sizes. 

3-6. A manufacturer of television sets is interested in the effect on tube conductivity of four 
different types of coating for color picture tubes. The following conductivity data are 
obtained: 


Coating Type 


Conductivity 


1 

143 

141 

150 

146 

2 

152 

149 

137 

143 

3 

134 

136 

132 

127 

4 

129 

127 

132 

129 


(a) Is there a difference in conductivity due to coating type? Use a = 0.05. 

(b) Estimate the overall mean and the treatment effects. 

(c) Compute a 95 percent confidence interval estimate of the mean of coating type 4. 
Compute a 99 percent confidence interval estimate of the mean difference between 
coating types 1 and 4. 

(d) Test all pairs of means using the Fisher LSD method with a = 0.05. 

(e) Use the graphical method discussed in Section 3-5.3 to compare the means. Which 
coating type produces the highest conductivity? 

(f) Assuming that coating type 4 is currently in use, what are your recommendations to 
the manufacturer? We wish to minimize conductivity. 

3-7. Reconsider the experiment from Problem 3-6. Analyze the residuals and draw conclusions 
about model adequacy. 

3-8. An article in the ACI Materials Journal (Vol. 84, 1987, pp. 213-216) describes several 
experiments investigating the rodding of concrete to remove entrapped air. A 3-inch X 
6-inch cylinder was used, and the number of times this rod was used is the design variable. 
The resulting compressive strength of the concrete specimen is the response. The data are 
shown in the following table: 


Rodding Level 


Compressive Strength 


10 

1530 

1530 

1440 

15 

1610 

1650 

1500 

20 

1560 

1730 

1530 

25 

1500 

1490 

1510 


(a) Is there any difference in compressive strength due to the rodding level? Use a = 
0.05. 

(b) Find the P-value for the F statistic in part (a). 

(c) Analyze the residuals from this experiment. What conclusions can you draw about 
the underlying model assumptions? 

(d) Construct a graphical display to compare the treatment means as described in Section 
3-5.3. 

3-9. An article in Environment International (Vol. 18, No. 4, 1992) describes an experiment 
in which the amount of radon released in showers was investigated. Radon-enriched water 
was used in the experiment, and six different orifice diameters were tested in shower 
heads. The data from the experiment are shown in the following table: 
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Orifice 

Diameter 


Radon Released (%) 


0.37 

80 

83 

83 

85 

0.51 

75 

75 

79 

79 

0.71 

74 

73 

76 

77 

1.02 

67 

72 

74 

74 

1.40 

62 

62 

67 

69 

1.99 

60 

61 

64 

66 


(a) Does the size of the orifice affect the mean percentage of radon released? Use a = 
0.05. 

(b) Find the F-value for the F statistic in part (a). 

(c) Analyze the residuals from this experiment. 

(d) Find a 95 percent confidence interval on the mean percent of radon released when 
the orifice diameter is 1.40. 

(e) Construct a graphical display to compare the treatment means as described in Section 
3-5.3 What conclusions can you draw? 

3-10. The response time in milliseconds was determined for three different types of circuits that 
could be used in an automatic valve shutoff mechanism. The results are shown in the 
following table: 


Circuit Type 



Response Time 



1 

9 

12 

10 

8 

15 

2 

20 

21 

23 

17 

30 

3 

6 

5 

8 

16 

7 


(a) Test the hypothesis that the three circuit types have the same response time. Use 

a = 0.01. 

(b) Use Tukey’s test to compare pairs of treatment means. Use a = 0.01. 

(c) Use the graphical procedure in Section 3-5.3 to compare the treatment means. What 
conclusions can you draw? How do they compare with the conclusions from part 

(b)? 

(d) Construct a set of orthogonal contrasts, assuming that at the outset of the experiment 
you suspected the response time of circuit type 2 to be different from the other two. 

(e) If you were the design engineer and you wished to minimize the response time, 
which circuit type would you select? 

(f) Analyze the residuals from this experiment. Are the basic analysis of variance as¬ 
sumptions satisfied? 

3-11. The effective life of insulating fluids at an accelerated load of 35 kV is being studied. 
Test data have been obtained for four types of fluids. The results were as follows: 


Fluid Type 



Life (in h) at 35 kV Load 



1 

17.6 

18.9 

16.3 

17.4 

20.1 

21.6 

2 

16.9 

15.3 

18.6 

17.1 

19.5 

20.3 

3 

21.4 

23.6 

19.4 

18.5 

20.5 

22.3 

4 

19.3 

21.1 

16.9 

17.5 

18.3 

19.8 




122 


CHAPTER 3 EXPERIMENTS WITH A SINGLE FACTOR: THE ANALYSIS OF VARIANCE 


(a) Is there any indication that the fluids differ? Use a = 0.05. 

(b) Which fluid would you select, given that the objective is long life? 

(c) Analyze the residuals from this experiment. Are the basic analysis of variance as¬ 
sumptions satisfied? 

3-12. Four different designs for a digital computer circuit are being studied to compare the 
amount of noise present. The following data have been obtained: 


Circuit Design 



Noise Observed 



1 

19 

20 

19 

30 

8 

2 

80 

61 

73 

56 

80 

3 

47 

26 

25 

35 

50 

4 

95 

46 

83 

78 

97 


(a) Is the amount of noise present the same for all four designs? Use a = 0.05. 

(b) Analyze the residuals from this experiment. Are the analysis of variance assumptions 
satisfied? 

(c) Which circuit design would you select for use? Low noise is best. 

3-13. Four chemists are asked to determine the percentage of methyl alcohol in a certain chem¬ 
ical compound. Each chemist makes three determinations, and the results are the 
following: 


Chemist 


Percentage of Methyl Alcohol 


1 

84.99 

84.04 

84.38 

2 

85.15 

85.13 

84.88 

3 

84.72 

84.48 

85.16 

4 

84.20 

84.10 

84.55 


(a) Do chemists differ significantly? Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) If chemist 2 is a new employee, construct a meaningful set of orthogonal contrasts 
that might have been useful at the start of the experiment. 

3-14. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the 
three brands are different. Five batteries of each brand are tested with the following results: 


Weeks of Life 

Brand 1 Brand 2 Brand 3 

100 76 108 

96 80 100 

92 75 96 

96 84 98 

92 82 100 


(a) Are the lives of these brands of batteries different? 

(b) Analyze the residuals from this experiment. 

(c) Construct a 95 percent confidence interval estimate on the mean life of battery brand 
2. Construct a 99 percent confidence interval estimate on the mean difference be¬ 
tween the lives of battery brands 2 and 3. 
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(d) Which brand would you select for use? If the manufacturer will replace without 
charge any battery that fails in less than 85 weeks, what percentage would the com¬ 
pany expect to replace? 

3-15. Four catalysts that may affect the concentration of one component in a three-component 
liquid mixture are being investigated. The following concentrations are obtained: 


Catalyst 

1 

2 

3 

4 

58.2 

56.3 

50.1 

52.9 

57.2 

54.5 

54.2 

49.9 

58.4 

57.0 

55.4 

50.0 

55.8 

55.3 


51.7 

54.9 





(a) Do the four catalysts have the same effect on the concentration? 

(b) Analyze the residuals from this experiment. 

(c) Construct a 99 percent confidence interval estimate of the mean response for cata¬ 
lyst 1. 

3-16. An experiment was performed to investigate the effectiveness of five insulating materials. 
Four samples of each material were tested at an elevated voltage level to accelerate the 
time to failure. The failure times (in minutes) are shown below: 


Material 


Failure Time (minutes) 


1 

110 

157 

194 

178 

2 

1 

2 

4 

18 

3 

880 

1256 

5276 

4355 

4 

495 

7040 

5307 

10,050 

5 

7 

5 

29 

2 


(a) Do all five materials have the same effect on mean failure time? 

(b) Plot the residuals versus the predicted response. Construct a normal probability plot 
of the residuals. What information is conveyed by these plots? 

(c) Based on your answer to part (b) conduct another analysis of the failure time data 
and draw appropriate conclusions. 

3-17. A semiconductor manufacturer has developed three different methods for reducing particle 
counts on wafers. All three methods are tested on five wafers and the after-treatment 
particle count obtained. The data are shown below: 


Method Count 

1 31 10 21 4 1 

2 62 40 24 30 35 

3 53 27 120 97 68 


(a) Do all methods have the same effect on mean particle count? 

(b) Plot the residuals versus the predicted response. Construct a normal probability plot 
of the residuals. Are there potential concerns about the validity of the assumptions? 

(c) Based on your answer to part (b) conduct another analysis of the particle count data 
and draw appropriate conclusions. 
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3-18. Consider testing the equality of the means of two normal populations, where the variances 
are unknown but are assumed to be equal. The appropriate test procedure is the pooled t 
test. Show that the pooled t test is equivalent to the single-factor analysis of variance. 
3-19. Show that the variance of the linear combination 2f =1 c,y ; . is o 2 'E“ =l n i c?. 

3-20. In a fixed effects experiment, suppose that there are n observations for each of four 
treatments. Let QQ\, Q\ be single-degree-of-freedom components for the orthogonal 
contrasts. Prove that SS Trcatments = 2i + + Qi- 

3-21. Use Bartlett’s test to determine if the assumption of equal variances is satisfied in Problem 
3-14. Use a = 0.05. Did you reach the same conclusion regarding equality of variances 
by examining residual plots? 

3-22. Use the modified Levene test to determine if the assumption of equal variances is satisfied 
in Problem 3-14. Use a = 0.05. Did you reach the same conclusion regarding the equality 
of variances by examining residual plots? 

3-23. Refer to Problem 3-10. If we wish to detect a maximum difference in mean response times 
of 10 milliseconds with a probability of at least 0.90, what sample size should be used? 
How would you obtain a preliminary estimate of a 2 ? 

3-24. Refer to Problem 3-14. 

(a) If we wish to detect a maximum difference in battery life of 10 hours with a prob¬ 
ability of at least 0.90, what sample size should be used? Discuss how you would 
obtain a preliminary estimate of a 2 for answering this question. 

(b) If the difference between brands is great enough so that the standard deviation of an 
observation is increased by 25 percent, what sample size should be used if we wish 
to detect this with a probability of at least 0.90? 

3-25. Consider the experiment in Problem 3-14. If we wish to construct a 95 percent confidence 
interval on the difference in two mean battery lives that has an accuracy of ±2 weeks, 
how many batteries of each brand must be tested? 

3-26. Suppose that four normal populations have means of fi, = 50, /x 2 = 60, /x 3 = 50, and 
/jl 4 = 60. How many observations should be taken from each population so that the 
probability of rejecting the null hypothesis of equal population means is at least 0.90? 
Assume that a = 0.05 and that a reasonable estimate of the error variance is a 2 = 25. 
3-27. Refer to Problem 3-26. 

(a) How would your answer change if a reasonable estimate of the experimental error 
variance were a 2 = 36? 

(b) How would your answer change if a reasonable estimate of the experimental error 
variance were a 2 = 49? 

(c) Can you draw any conclusions about the sensitivity of your answer in this particular 
situation about how your estimate of tr affects the decision about sample size? 

(d) Can you make any recommendations about how we should use this general approach 
to choosing n in practice? 

3-28. Refer to the aluminum smelting experiment described in Section 3-8. Verify that ratio 
control methods do not affect average cell voltage. Construct a normal probability plot of 
the residuals. Plot the residuals versus the predicted values. Is there an indication that any 
underlying assumptions are violated? 

3-29. Refer to the aluminum smelting experiment in Section 3-8. Verify the analysis of variance 
for pot noise summarized in Table 3-13. Examine the usual residual plots and comment 
on the experimental validity. 

3-30. Four different feed rates were investigated in an experiment on a CNC machine producing 
a component part used in an aircraft auxiliary power unit. The manufacturing engineer in 
charge of the experiment knows that a critical part dimension of interest may be affected 
by the feed rate. However, prior experience has indicated that only dispersion effects are 
likely to be present. That is, changing the feed rate does not affect the average dimension, 
but it could affect dimensional variability. The engineer makes five production runs at 
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each feed rate and obtains the standard deviation of the critical dimension (in 10 3 mm). 
The data are shown below. Assume that all runs were made in random order. 


Feed Rate 
(in/min) 


Production Run 



1 

2 

3 

4 

5 

10 

0.09 

0.10 

0.13 

0.08 

0.07 

12 

0.06 

0.09 

0.12 

0.07 

0.12 

14 

0.11 

0.08 

0.08 

0.05 

0.06 

16 

0.19 

0.13 

0.15 

0.20 

0.11 


(a) Does feed rate have any effect on the standard deviation of this critical dimension? 

(b) Use the residuals from this experiment to investigate model adequacy. Are there any 
problems with experimental validity? 

3-31. Consider the data shown in Problem 3-10. 

(a) Write out the least squares normal equations for this problem, and solve them for 
p and t i, using the usual constraint (2 3 =1 f, = 0). Estimate - t 2 . 

(b) Solve the equations in (a) using the constraint f 3 = 0. Are the estimators f, and p 
the same as you found in (a)? Why? Now estimate t, - t 2 and compare your answer 
with that for (a). What statement can you make about estimating contrasts in the r,? 

(c) Estimate fi + t u 2t x - t 2 - r 3 , and n + t, + r 2 using the two solutions to the 
normal equations. Compare the results obtained in each case. 

3-32. Apply the general regression significance test to the experiment in Example 3-1. Show 
that the procedure yields the same results as the usual analysis of variance. 

3-33. Use the Kruskal-Wallis test for the experiment in Problem 3-11. Compare the conclusions 
obtained with those from the usual analysis of variance. 

3-34. Use the Kruskal-Wallis test for the experiment in Problem 3-12. Are the results compa¬ 
rable to those found by the usual analysis of variance? 

3-35. Consider the experiment in Example 3-1. Suppose that the largest observation on tensile 
strength is incorrectly recorded as 50. What effect does this have on the usual analysis of 
variance? What effect does it have on the Kruskal-Wallis test? 



Randomized Blocks, 
Latin Squares, and 
Related Designs 


4-1 THE RANDOMIZED COMPLETE BLOCK DESIGN 

In any experiment, variability arising from a nuisance factor can affect the results. Gen¬ 
erally, we define a nuisance factor as a design factor that probably has an effect on the 
response, but we are not interested in that effect. Sometimes a nuisance factor is un¬ 
known and uncontrolled; that is, we don’t know that the factor exists and it may even 
be changing levels while we are conducting the experiment. Randomization is the design 
technique used to guard against such a “lurking” nuisance factor. In other cases, the 
nuisance factor is known but uncontrollable. If we can at least observe the value that 
the nuisance factor takes on at each run of the experiment, we can compensate for it in 
the statistical analysis by using the analysis of covariance, a technique we will discuss 
in Chapter 14. When the nuisance source of variability is known and controllable, a 
design technique called blocking can be used to systematically eliminate its effect on 
the statistical comparisons among treatments. Blocking is an extremely important design 
technique, used extensively in industrial experimentation, and is the subject of this 
chapter. 

To illustrate the general idea, suppose we wish to determine whether or not four 
different tips produce different readings on a hardness testing machine. An experiment 
such as this might be part of a gauge capability study. The machine operates by pressing 
the tip into a metal test coupon, and from the depth of the resulting depression, the 
hardness of the coupon can be determined. The experimenter has decided to obtain four 
observations for each tip. There is only one factor—tip type—and a completely random¬ 
ized single-factor design would consist of randomly assigning each one of the 4 X 4 = 
16 runs to an experimental unit, that is, a metal coupon, and observing the hardness 
reading that results. Thus, 16 different metal test coupons would be required in this 
experiment, one for each run in the design. 

There is a potentially serious problem with a completely randomized experiment in 
this design situation. If the metal coupons differ slightly in their hardness, as might 
126 
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Table 4-1 Randomized Complete Block Design 
for the Hardness Testing Experiment 


Type of Tip 


Test Coupon 


1 

2 

3 

4 

1 

9.3 

9.4 

9.6 

10.0 

2 

9.4 

9.3 

9.8 

9.9 

3 

9.2 

9.4 

9.5 

9.7 

4 

9.7 

9.6 

10.0 

10.2 


happen if they are taken from ingots that are produced in different heats, the experimental 
units (the coupons) will contribute to the variability observed in the hardness data. As a 
result, the experimental error will reflect both random error and variability between 
coupons. 

We would like to make the experimental error as small as possible; that is, we would 
like to remove the variability between coupons from the experimental error. A design 
that would accomplish this requires the experimenter to test each tip once on each of 
four coupons. This design, shown in Table 4-1, is called a randomized complete block 
design (RCBD). The observed response is the Rockwell C scale hardness minus 40. 
The word “complete” indicates that each block (coupon) contains all the treatments 
(tips). By using this design, the blocks, or coupons, form a more homogeneous experi¬ 
mental unit on which to compare the tips. Effectively, this design strategy improves the 
accuracy of the comparisons among tips by eliminating the variability among the cou¬ 
pons. Within a block, the order in which the four tips are tested is randomly determined. 
Notice the similarity of this design problem to the one of Section 2-5 in which the paired 
f-test was discussed. The randomized complete block design is a generalization of that 
concept. 

The RCBD is one of the most widely used experimental designs. Situations for which 
the RCBD is appropriate are numerous. Units of test equipment or machinery are often 
different in their operating characteristics and would be a typical blocking factor. Batches 
of raw material, people, and time are also common nuisance sources of variability in an 
experiment that can be systematically controlled through blocking. 

Blocking may also be useful in situations that do not necessarily involve nuisance 
factors. For example, suppose that a chemical engineer is interested in the effect of 
catalyst feed rate on the viscosity of a polymer. She knows that there are several factors, 
such as raw material source, temperature, operator, and raw material purity that are very 
difficult to control in the full-scale process. Therefore she decides to test the catalyst feed 
rate factor in blocks, where each block consists of some combination of these uncon¬ 
trollable factors. In effect, she is using the blocks to test the robustness of her process 
variable (feed rate) to conditions she cannot easily control. For more discussion of this, 
see Coleman and Montgomery (1993). 

4*1.1 Statistical Analysis of the RCBD 

Suppose we have, in general, a treatments that are to be compared and b blocks. The 
randomized complete block design is shown in Figure 4-1 on the next page. There is one 
observation per treatment in each block, and the order in which the treatments are run 
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Block 1 Block 2 Block b 



within each block is determined randomly. Because the only randomization of treat¬ 
ments is within the blocks, we often say that the blocks represent a restriction on 
randomization. 

The statistical model for the RCBD can be written in several ways. The traditional 
model is an effects model: 

y„ - 11+ T, + + e„jj * {’ 2’;'; ’ £ «-•> 

where h is an overall mean, 77 is the effect of the ith treatment, j3, is the effect of the ;th 
block, and e,y is the usual NID (0, a 2 ) random error term. We will initially consider 
treatments and blocks to be fixed factors. Just as in the single-factor experimental design 
model in Chapter 3, the effects model for the RCBD is an overspecified model. Conse¬ 
quently, we usually think of the treatment and block effects as deviations from the overall 
mean so that 


2 t, = 0 and 2 Pj = 0 

/= 1 j= 1 

It is also possible to use a means model for the RCBD, say 

y, = «, + «„{}: \\ 

where Ho = /x + t, + /3 y . However, we will use the effects model in Equation 4-1 
throughout this chapter. 

In an experiment involving the RCBD, we are interested in testing the equality of 
the treatment means. Thus, the hypotheses of interest are 

Ho'-fii = Hl= ' = Ha 
Hi : at least one Hi ^ Hj 

Because the ith treatment mean Hi ~ (l/b)2*=i(/x + 77 + /S,) = h + T <> an equivalent 
way to write the above hypotheses is in term s of the treatment effects, say 

H 0 :Ti = t 2 = ■ ■ ■ = r a = 0 
Hi: 77 + 0 at least one i 
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Let y L be the total of all observations taken under treatment i, y mj be the total of all 
observations in block j, be the grand total of all observations, and N = ab be the total 
number of observations. Expressed mathematically. 


2 Xy 

i = l,2,.. 

•, a 

(4-2) 

2x7 

j= 1,2,.. 

.,b 

(4-3) 


and 

x. = S S y v = 2 x. = 2 y.j (4-4) 

i= 1 7=1 1 = 1 7=1 

Similarly, y,. is the average of the observations taken under treatment i, y.j is the average 
of the observations in block j, and y_ is the grand average of all observations. That is, 

X. = yjb y.j = yja yJN (4-5) 

We may express the total corrected sum of squares as 

2 2 (yv - x.) 2 = 22 t(x. - x.) 

i=ij=i i=ij : 1 _ 2 ( 4 - 6 ) 

+ (y.j ~ x.) + (y.j - y t . ~ y.j + x.)] 

By expanding the right-hand side of Equation 4-6, we obtain 

2 2 (y.j ~ y-f = b i (y, - y..) 2 + a 2 (y.j - y..) 2 

+ 2 2 (yu ~ % ~ y.j + x.) 2 + 222 (x. - y..)(y.j - x.) 

+ 2 2 2 (j,- - y.)(yu ~ x. - y, + x.) 

a b 

+ 222 (x. - y..)(ytj ~ x. - y.j + x.) 

1=1 7=1 

Simple but tedious algebra proves that the three cross-products are zero. Therefore, 

2 2 (y.j - x.) 2 = b 2 (x. - x.) 2 + « 2 (x/ _ x.) 2 

+ 2 2 (y.j - y.j - x. + x.) 2 (4-7) 

represents a partition of the total sum of squares. Expressing the sums of squares in 
Equation 4-7 symbolically, we have 

SS T = SS Treatments + SSsiocks + SS E (4-8) 

Because there are N observations, SS, has N — 1 degrees of freedom. There are 
a treatments and b blocks, so S5 Trea tment s and SS Blocks have a - l and b — 1 degrees 
of freedom, respectively. The error sum of squares is just a sum of squares between 
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cells minus the sum of squares for treatments and blocks. There are ab cells with 
ab - 1 degrees of freedom between them, so SS E has ab - 1 - (a - 1) - (b - 1) = 
(a — l)(b — 1) degrees of freedom. Furthermore, the degrees of freedom on the right- 
hand side of Equation 4-8 add to the total on the left; therefore, making the usual nor¬ 
mality assumptions on the errors, one may use Theorem 3-1 to show that S’S’-rreatments/o' 2 , 
SS Block Jcr 2 , and SS E /a 2 are independently distributed chi-square random variables. Each 
sum of squares divided by its degrees of freedom is a mean square. The expected value 
of the mean squares, if treatments and blocks are fixed, can be shown to be 

E(MS t reatments) = O ’ 2 + 

a — 1 

E(MS b locks) = a 2 + ’ 1 

b — 1 

E(MS e ) = a - 2 

Therefore, to test the equality of treatment means, we would use the test statistic 



which is distributed as F a - ly(a - 1Xb - u if the null hypothesis is true. The critical region is 
the upper tail of the F distribution, and we would reject H 0 if F 0 > F ata - Ua ^ X)(b ^ r) . 

We may also be interested in comparing block means because, if these means do not 
differ greatly, blocking may not be necessary in future experiments. From the expected 
mean squares, it seems that the hypothesis H 0 : /S, = 0 may be tested by comparing the 
statistic F 0 = MS Block JMS E to F aJ ,- Ua - 1)(h - ly However, recall that randomization has 
been applied only to treatments within blocks; that is, the blocks represent a restriction 
on randomization. What effect does this have on the statistic F 0 = MS mock JMS E l Some 
differences in treatment of this question exists. For example, Box, Hunter, and Hunter 
(1978) point out that the usual analysis of variance F test can be justified on the basis of 
randomization only, 1 without direct use of the normality assumption. They further ob¬ 
serve that the test to compare block means cannot appeal to such a justification because 
of the randomization restriction; but if the errors are NID(0, cr 2 ), the statistic F 0 = 
MS Blocks /MS E can be used to compare block means. On the other hand, Anderson and 
McLean (1974) argue that the randomization restriction prevents this statistic from being 
a meaningful test for comparing block means and that this F ratio really is a test for the 
equality of the block means plus the randomization restriction [which they call a restric¬ 
tion error; see Anderson and McLean (1974) for further details]. 

In practice, then, what do we do? Because the normality assumption is often ques¬ 
tionable, to view F 0 = MS Block JMS E as an exact F test on the equality of block means 
is not a good general practice. For that reason, we exclude this F test from the analysis 
of variance table. However, as an approximate procedure to investigate the effect of the 
blocking variable, examining the ratio of MS mocks to MS E is certainly reasonable. If this 


1 Actually, the normal-theory F distribution is an approximation to the randomization distribution generated by 
calculating F„ from every possible assignment of the responses to the treatments. 
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Table 4-2 Analysis of Variance for a Randomized Complete Block Design 


Source 
of Variation 

Sum of Squares 

Degrees 
of Freedom 

Mean Square 

F 0 

Treatments 

SS Treatments 

a - 1 

SS Trv; .. 

M ^Treatments 

Blocks 

SS„ locks 

b - 1 

SS mocks 

ms e 

Error 

SS E 

(a - 1X6 - 1) 

b - 1 

SS E 


Total 

SS T 

N - 1 

(< a - 1X6 - 1) 



ratio is large, it implies that the blocking factor has a large effect and that the noise 
reduction obtained by blocking was probably helpful in improving the precision of the 
comparison of treatment means. 

The procedure is usually summarized in an analysis of variance table, such as the 
one shown in Table 4-2. The computing would usually be done with a statistical software 
package. However, manual computing formulas for the sums of squares may be obtained 
for the elements in Equation 4-7 by expressing them in terms of treatment and block 
totals. These computing formulas are 


SS t = 


(4-9) 

SS Tle aunen 

i v 2 yt 

« = b& y ‘--N 

(4-10) 

SSbIocI 

a j= i N 

(4-11) 

and the error sum of squares is obtained by subtraction as 


SSe — SSt 

^Treatments ^Blocks 

(4-12) 


EXAMPLE 4-1 . 

Consider the hardness testing experiment described in Section 4-1. There are four tips 
and four available metal coupons. Each tip is tested once on each coupon, resulting in a 
randomized complete block design. The data obtained are repeated for convenience in 
Table 4-3. Remember that the order in which the tips were tested on a particular coupon 
was determined randomly. To simplify the calculations, we code the original data by 


Table 4-3 Randomized Complete Block Design 
for the Hardness Testing Experiment 


Type of Tip 


Coupon (Block) 


1 

2 

3 

4 

1 

9.3 

9.4 

9.6 

10.0 

2 

9.4 

9.3 

9.8 

9.9 

3 

9.2 

9.4 

9.5 

9.7 

4 

9.7 

9.6 

10.0 

10.2 
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Table 4-4 Coded Data for the Hardness Testing Experiment 


Type of Tip 

i 

Coupon (Block) 

2 3 

4 

y. 

1 

-2 

-1 

1 

5 

3 

2 

-1 

-2 

3 

4 

4 

3 

-3 

-1 

0 

2 

-2 

4 

2 

1 

5 

7 

15 

y.j 

-4 

-3 

9 

18 

20 = y_ 


subtracting 9.5 from each observation and multiplying the result by 10. This yields the 
data in Table 4-4. The sums of squares are obtained as follows: 


(20) 2 

= 154.00 - = 129.00 

‘^Treatments T 2 3^'. 77 

b i=\ N 

= 7 [(3 ) 2 + (4) 2 + (—2 ) 2 + (15) 2 ] ~^~ = 38.50 

4 16 

^Blocks = - 2 yj ~ 77 

a j= i A 

1 (20) 2 
= ^ [( 4) 2 + (—3 ) 2 + (9) 2 + (18) 2 ] - K ~~- = 82.50 

SS E = SS T — 55 Treatments — SS B iocks 
= 129.00 - 38.50 - 82.50 = 8.00 


The analysis of variance is shown in Table 4-5. Using a = 0.05, the critical value of F 
is F 0.05,3,9 = 3.86. Because 14.44 > 3.86, we conclude that the type of tip affects the 
mean hardness reading. The P-value for the test is also quite small. Also, the coupons 
(blocks) seem to differ significantly, because the mean square for blocks is large relative 
to error. 

It is interesting to observe the results we would have obtained had we not been aware 
of randomized block designs. Suppose we used four coupons, randomly assigned the tips 
to each, and (by chance) the same design resulted as in Table 4-3. The incorrect analysis 
of these data as a completely randomized single-factor design is shown in Table 4-6. 


Table 4-5 Analysis of Variance for the Hardness Testing Experiment 
Source of Sum of Degrees of Mean 

Variation Squares Freedom Square F 0 P-Value 

Treatments (type 

of tip) 38.50 3 12.83 14.44 0.0009 

Blocks (coupons) 82.50 3 27.50 

Error 8.00 9 0.89 

Total 129.00 


15 
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Table 4-6 Incorrect Analysis of the Hardness Testing Experiment 
as a Completely Randomized Design 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 


Fo 

Type of tip 

38.50 

3 

12.83 

1.70 

Error 

90.50 

12 

7.54 


Total 

129.00 

15 




Because F 0 . 05 , 3 ,i 2 = 3.49, the hypothesis of equal mean hardness measurements from the 
four tips cannot be rejected. Thus, the randomized block design reduces the amount of 
noise in the data sufficiently for differences among the four tips to be detected. This 
illustrates a very important point. If an experimenter fails to block when he or she should 
have, the effect may be to inflate the experimental error so much that important differ¬ 
ences among the treatment means may be undetectable. 


Sample Computer Output 

Condensed computer output for the hardness testing data in Example 4-1, obtained from 
Design-Expert, is shown in Figure 4-2 on page 134. Recall that the original analysis in 
Table 4-5 used coded data. (The raw responses were coded by subtracting 9.5 and mul¬ 
tiplying the result by 10.) The computer analysis used the raw responses. Consequently, 
the sums of squares in Figure 4-2 are equal to those in Table 4-5 divided by 100 (note 
that Design-Expert has rounded the sums of squares to two decimal places). 

The residuals are listed at the bottom of the computer output. They are calculated as 

e u = yu ~ h 

and, as we will later show, the fitted values are y, 7 = y L + y. y — y_, so 

e i} = y i} - % - y.j + y.. (4-13) 

In the next section, we will show how the residuals are used in model adequacy 
checking. 

Multiple Comparisons 

If the treatments in an RCBD are fixed, and the analysis indicates a significant difference 
in treatment means, the experimenter is usually interested in multiple comparisons to 
discover which treatment means differ. Any of the multiple comparison procedures dis¬ 
cussed in Chapter 3 (Section 3-5) may be used for this purpose. In the formulas of Section 
3-5, simply replace the number of replicates in the single-factor completely randomized 
design (n) by the number of blocks (h). Also, remember to use the number of error 
degrees of freedom for the randomized block [(a - 1 )(b — 1)] instead of those for the 
completely randomized design [a(n — 1)]. 

The Design-Expert output in Figure 4-2 illustrates the Fisher LSD procedure. Notice 
that if we use a = 0.05, we would conclude that ^ = /r> Now because y 3 . =£ y L =£ 
y 2 . (that is, the means y 2 . and y 3 . span some of the remaining means), we can immediately 
conclude that p., = /jl 2 = Furthermore, (jl 4 is different from all three other means. 
Thus we conclude that tip type 4 produces a mean hardness that is significantly higher 
than the mean hardness readings from the other three types of tips. 

We can also use the graphical procedure of Chapter 3 (Section 3-5.1) to compare tip 
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Response: Hardness in Rockwell C 

ANOVA for Selected Factorial Model 

Analysis of variance table [Partial sum of squares] 




Source 

Sum of 

Squares DF 

Mean F 

Square Value Prob 

> F 


Block 

0.82 3 

0.27 




Model 

0.38 3 

0.13 14.44 0.0009 s 

ignificant 

A 

0.38 3 

0.13 14.44 0.0009 


Residual 

0.080 9 

8.889E-003 




Cor 

Total 

1.29 15 





Std. Dev. 

0.094 

R-Squared 

0.8280 



Mean 

9.63 

Adj R-Squared 

0.7706 



C.V. 

0.98 

Pred R-Squared 

0.4563 



PRESS 

0.25 

Adeq Precision 

15.635 



Treatment Means (Adjusted, If Necessary) 

Estimated Standard 

Mean Error 




1-A1 

9.57 0.47 





2-A2 

9.60 0.47 





3-A3 

9.45 0.47 





4-A4 

9.88 0.47 





Mean 

Treatment Difference 

Standard 

DF Error 

t for H 0 
Coeff=0 

Prob > 11 


1 vs 2 

-0.025 

1 0.067 

-0.38 

0.7163 


1 vs 3 

0.13 

1 0.067 

1.87 

0.0935 


1 vs 4 

-0.30 

1 0.067 

-4.50 

0.0015 


2 vs 3 

0.15 

1 0.067 

2.25 

0.0510 


2 vs 4 

-0.27 

1 0.067 

-4.12 

0.0026 


3 vs 4 

-0.43 

1 0.067 

-6.37 

0.0001 


Diagnostics Case Statistics 

Standard Actual Predicted 

Order Value Value Residual Leverage 

Student 

Residual 

Cook's 

Distance 

Outlier 

t 

1 

9.30 9.35 

-0.050 0.438 

-0.707 

0.056 

-0.686 

2 

9.40 9.38 

0.025 0.438 

0.354 

0.014 

0.336 

3 

9.60 9.67 

-0.075 0.437 

-1.061 

0.125 

-1.069 

4 

10.00 9.90 

0.100 0.438 

1.414 

0.222 

1.512 

5 

9.40 9.38 

0.025 0.438 

0.354 

0.014 

0.336 

6 

9.30 9.40 

-0.100 0.437 

-1.414 

0.222 

-1.512 

7 

9.80 9.70 

0.100 0.437 

1.414 

0.222 

1.512 

8 

9.90 9.93 

-0.025 0.437 

-0.354 

0.014 

-0.336 

9 

9.20 9.22 

-0.025 0.438 

-0.354 

0.014 

-0.336 

10 

9.40 9.25 

0.150 0.437 

2.121 

0.500 

2.828 

11 

9.50 9.55 

-0.050 0.437 

-0.707 

0.056 

-0.686 

12 

9.70 9.78 

-0.075 0.437 

-1.061 

0.125 

-1.069 

13 

9.70 9.65 

0.050 0.438 

0.707 

0.056 

0.686 

14 

9.60 9.68 

-0.075 0.437 

-1.061 

0.125 

-1.069 

15 

10.00 9.97 

0.025 0.437 

0.354 

0.014 

0.336 

16 

10.20 10.20 

0.000 0.437 

0.000 

0.000 

0.000 

Figure 4-2 

Design-Expert output (condensed) for Example 4-1. 






4-1 THE RANDOMIZED COMPLETE BLOCK DESIGN 


135 


Tip 


TipTip Tip 

12 4 


12 3 4 


Average hardness (coded) 

Fi gure 4 -3 Tip typ e means relative to a scaled t distribution with a scale factor 
VMSJb = V089/4 = 0.47. 


type means. Figure 4-3 plots the four ti p type mean s from Example 4-1 relative to a 
scaled t distribution with a scale factor \/MS E /b = VO.89/4 = 0.47. This plot indicates 
that tips 1, 2, and 3 probably produce identical average hardness measurements but that 
tip 4 produces a much higher mean hardness. This figure confirms the results from the 
Fisher LSD test in the Design-Expert output in Figure 4-2. 


4-1.2 Model Adequacy Checking 

We have previously discussed the importance of checking the adequacy of the assumed 
model. Generally, we should be alert for potential problems with the normality assump¬ 
tion, unequal error variance by treatment or block, and block-treatment interaction. As 
in the completely randomized design, residual analysis is the major tool used in this 
diagnostic checking. The residuals for the randomized block design in Example 4-1 are 
listed at the bottom of the Design-Expert output in Figure 4-2. The coded residuals would 
be found by multiplying these residuals by 10. The observations, fitted values, and re¬ 
siduals for the coded hardness testing data in Example 4-1 are as follows: 


- 2.00 

- 1.00 

1.00 

5.00 

-1.00 

-2.00 

3.00 

4.00 

-3.00 

-1.00 

0.00 

2.00 

2.00 

1.00 

5.00 

7.00 


-1.50 

-1.25 

1.75 
4.00 

-1.25 

- 1.00 

2.00 

4.25 

-2.75 

-2.50 

0.50 

2.75 
1.50 

1.75 

4.75 
7.00 


-0.50 

0.25 

-0.75 

1.00 

0.25 

- 1.00 

1.00 

-0.25 

-0.25 

1.50 

-0.50 

-0.75 

0.50 

-0.75 

0.25 

0.00 


A normal probability plot and a dot diagram of these residuals are shown in Figure 
4-4 on page 136. There is no severe indication of nonnormality, nor is there any evidence 
pointing to possible outliers. Figure 4-5 (page 137) shows plots of the residuals by type 
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Figure 4-4 Normal probability plot of residuals for Example 4-1. 


of tip or treatment and by coupon or block. These plots could be, potentially, very 
informative. If there is more scatter in the residuals for a particular tip, that could indicate 
that this tip produces more erratic hardness readings than the others. More scatter in the 
residuals for a particular test coupon could indicate that the coupon is not of uniform 
hardness. However, in our example, Figure 4-5 gives no indication of inequality of vari¬ 
ance by treatment or by block. Figure 4-6 (on the facing page) plots the residuals versus 
the fitted values y 0 . There should be no relationship between the size of the residuals 
and the fitted values y t j. This plot reveals nothing of unusual interest. 

Sometimes the plot of residuals versus y,-, has a curvilinear shape; for example, there 
may be a tendency for negative residuals to occur with low y tJ values, positive residuals 
with intermediate y tJ values, and negative residuals with high y tj values. This type of 
pattern is suggestive of interaction between blocks and treatments. If this pattern occurs, 
a transformation should be used in an effort to eliminate or minimize the interaction. In 
Chapter 5 (Section 5-3.7), we describe a statistical test that can be used to detect the 
presence of interaction in a randomized block design. 


4-1.3 Some Other Aspects of the Randomized Complete Block Design 
Additivity of the Randomized Block Model 

The linear statistical model that we have used for the randomized block design 
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Figure 4-5 Plot of residuals by tip type (treatment) and by 
coupon (block) for Example 4-1. 


is completely additive. This says that, for example, if the first treatment causes the 
expected response to increase by five units (tj = 5) and if the first block increases the 
expected response by 2 units (/3i = 2), the expected increase in response of both treatment 
1 and block 1 together is E(y n ) = /a + t 1 + /3 1 = )u. + 5 + 2 = )u. + 7. In general, 
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treatment 1 always increases the expected response by 5 units over the sum of the overall 
mean and the block effect. 

Although this simple additive model is often useful, there are situations where it is 
inadequate. Suppose, for example, that we are comparing four formulations of a chemical 
product using six batches of raw material; the raw material batches are considered blocks. 
If an impurity in batch 2 affects formulation 2 adversely, resulting in an unusually low 
yield, but does not affect the other formulations, an interaction between formulations 
(or treatments) and batches (or blocks) has occurred. Similarly, interactions between 
treatments and blocks can occur when the response is measured on the wrong scale. 
Thus, a relationship that is multiplicative in the original units, say 

E(y 0 ) = liTiPj 

is linear or additive in a log scale since, for example, 

In E(y i} ) = In /i + In 77 + In 


or 


E(yfj) = fi* + r* + )S* 

Although this type of interaction can be eliminated by a transformation, not all interac¬ 
tions are so easily treated. For example, transformations do not eliminate the formulation- 
batch interaction discussed previously. Residual analysis and other diagnostic checking 
procedures can be helpful in detecting nonadditivity. 

If interaction is present, it can seriously affect and possibly invalidate the analysis 
of variance. In general, the presence of interaction inflates the error mean square and 
may adversely affect the comparison of treatment means. In situations where both factors, 
as well as their possible interaction, are of interest, factorial designs must be used. These 
designs are discussed extensively in Chapters 5 through 9. 

Random Treatments and Blocks 

Although we have described the test procedure considering treatments and blocks as 
fixed factors, the same analysis procedure is used if either the treatments or blocks (or 
both) are random. However, there are some changes in the interpretation of the results. 
For example, if the blocks are random, which is quite often the case, we expect the 
comparisons among the treatments to be the same throughout the population of blocks 
from which those used in the experiment were randomly selected. There are also corre¬ 
sponding changes in the expected mean squares. For example, if the blocks are indepen¬ 
dent random variables with common variance, E(MS RlockH ) = a 2 + aa 2 p , where cr 2 p is the 
variance component of the block effects. In any event, E(MS T rea tments) is always free of 
any block effect, and the test statistic for between-treatment variability is always 
F 0 = MS TrcMmcn JMS F . 

In situations where the blocks are random, if treatment-block interaction is present, 
the test on treatment means is unaffected by the interaction. The reason for this is that 
the expected mean squares for treatments and error both contain the interaction effect; 
consequently, the test for differences in treatment means may be conducted as usual by 
comparing the treatment mean square to the error mean square. This procedure does not 
yield any information on the interaction. 
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Choice of Sample Size 

Choosing the sample size, or the number of blocks to run, is an important decision 
when using an RCBD. Increasing the number of blocks increases the number of replicates 
and the number of error degrees of freedom, making the design more sensitive. Any of 
the techniques discussed in Chapter 3 (Section 3-7) for selecting the number of replicates 
to run in a completely randomized single-factor experiment may be applied directly to 
the RCBD. For the case of a fixed factor, the operating characteristic curves in Appendix 
Chart V may be used with 

b 2 t? 

<F 2 = i= \ (4-14) 

acr 

where there are a - 1 numerator degrees of freedom and (a - 1 ){b — 1) denominator 
degrees of freedom. 


EXAMPLE 4-2 . 

Consider the hardness testing problem described in Example 4-1. Suppose that we wish 
to determine the appropriate number of blocks to run if we are interested in detecting a 
true maximum difference in mean hardness readings of 0.4 with a high probability and 
a reasonable estimate of the standard deviation of the errors is a = 0.1. (These values 
are given in the original units; recall that the analysis of variance was performed using 
coded data.) From Equation 3-49, the minimum value of <1> 2 is (writing b, the number of 
blocks, for ri) 


<D 2 


bD 2 
2aa 2 


where D is the maximum difference we wish to detect. Thus, 


= 


fr(0.4) 2 

2(4)(0.1) 2 


2.0 b 


If we use b = 3 blocks, $ = VZOb = V2.0(3) = 2.45, and there are (a - 1 )(b - 1) = 
3(2) = 6 error degrees of freedom. Appendix Chart V with v l = a — 1 = 3 and a = 0.05 
indicates that the /3 risk for this desi gn is approx imately 0.10 (power = 1 — /3 = 0.90). 
If we use b = 4 blocks, = Vz6b = V2.0(4) = 2.83, with (a - 1 )(b - 1) = 3(3) = 
9 error degrees of freedom, and the corresponding /3 risk is approximately 0.03 (power = 
1 — j8 = 0.97). Either three or four blocks will result in a design having a high probability 
of detecting the difference between the mean hardness readings considered important. 
Because coupons (blocks) are inexpensive and readily available and the cost of testing 
is low, the experimenter decides to use four blocks. 


Estimating Missing Values 

When using the RCBD, sometimes an observation in one of the blocks is missing. This 
may happen because of carelessness or error or for reasons beyond our control, such as 
unavoidable damage to an experimental unit. A missing observation introduces a new 
problem into the analysis because treatments are no longer orthogonal to blocks; that 
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Table 4-7 Randomized Complete Block Design for the Hardness 
Testing Experiment with One Missing Value 


Type of Tip 


Coupon (Block) 


1 

2 3 

4 

1 

-2 

-1 1 

5 

2 

-1 

-2 x 

4 

3 

-3 

-1 0 

2 

4 

2 

1 5 

7 


is, every treatment does not occur in every block. There are two general approaches to 
the missing value problem. The first is an approximate analysis in which the missing 
observation is estimated and the usual analysis of variance is performed just as if the 
estimated observation were real data, with the error degrees of freedom reduced by 1. 
This approximate analysis is the subject of this section. The second is an exact analysis, 
which is discussed in Section 4-1.4. 

Suppose the observation y tf for treatment i in block j is missing. Denote the missing 
observation by x. As an illustration, suppose that in the hardness testing experiment of 
Example 4-1 coupon 3 was broken while tip 2 was being tested and that data point could 
not be obtained. The data might appear as in Table 4-7. 

In general, we will let y[, represent the grand total with one missing observation, 
y'i, represent the total for the treatment with one missing observation, and y [j be the total 
for the block with one missing observation. Suppose we wish to estimate the missing 
observation x so that x will have a minimum contribution to the error sum of squares. 
Because SS E = E? =1 2* =1 (y,;/ — y L — y.j + y..) 2 , this is equivalent to choosing x to 
minimize 


SS E = 2 S yl 




or 

SS E = x 2 - \ (yL + x) 2 - - (. y'.j + x? + \ (yL + xf + R (4-15) 
b a ab 


where R includes all terms not involving x. From dSSJdx = 0, we obtain 
_ ay\_ + by'j — yL 
* “ (a - l)(b - 1) 


(4-16) 


as the estimate of the missing observation. 

For the data in Table 4-7, we find that yL. = 1, y[ 3 = 6, and yL = 17. Therefore, 
from Equation 4-16, 


* = y 23 


4(1) + 4(6) - 17 
(3)(3) 


1.22 


The usual analysis of variance may now be performed using y 23 = 1.22 and reducing 
the error degrees of freedom by 1. The analysis of variance is shown in Table 4-8 on the 
facing page. Compare the results of this approximate analysis with the results obtained 
for the full data set (Table 4-5). 
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Table 4-8 Approximate Analysis of Variance for Example 4-1 with One Missing Value 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

Type of tip 

39.98 

3 

13.33 

17.12 

0.0008 

Specimens (blocks) 

79.53 

3 

26.51 



Error 

6.22 

8 

0.78 



Total 

125.73 

14 





If several observations are missing, they may be estimated by writing the error sum 
of squares as a function of the missing values, differentiating with respect to each missing 
value, equating the results to zero, and solving the resulting equations. Alternatively, we 
may use Equation 4-16 iteratively to estimate the missing values. To illustrate the iterative 
approach, suppose that two values are missing. Arbitrarily estimate the first missing 
value, and then use this value along with the real data and Equation 4-16 to estimate the 
second. Now Equation 4-16 can be used to re-estimate the first missing value, and fol¬ 
lowing this, the second can be re-estimated. This process is continued until convergence 
is obtained. In any missing value problem, the error degrees of freedom are reduced by 
one for each missing observation. 


4-1.4 Estimating Model Parameters 

and the General Regression Significance Test 

If both treatments and blocks are fixed, we may estimate the parameters in the RCBD 
model by least squares. Recall that the linear statistical model is 

ya - IL + Ti + Pj + <k,jj = }’ 2’ ‘ ‘ ‘ \° b ( 4 ‘ 17 ) 

Applying the rules in Section 3-9.2 for finding the normal equations for an experimental 
design model, we obtain 


fx\ abfi + bf t + bt2 + • 

• + bt a + aj8i 

+ <*& + • 

■ + ap h = y_ 

T \: bfi + bt x 



+ 

Pi 

+ P 2 + ■ 

■ + p h = yi. 

t 2 : bfi 

+ bt 2 


+ 

Pi 

+ P 2 + • 

■ + p h = y 2 . 

T a : bfi 



+ bf a + 

A 

+ Pi + • 

■ + A = ya. 

Pi- afi, + 

Tl + 

T 2 + ' 

■ + 

ta + apt 


= y.i 

Pi- afi + 

?1 + 

T2 + ' 

■ + 

T a 


+ ap 2 

= y. 2 

Pb- afi + 

Tl + 

f 2 + ■ 

■ + 




+ ap h = y. h 


Notice that the second through the (a + l)st equations in Equation 4-18 sum to the 
first normal equation, as do the last b equations. Thus, there are two linear dependencies 
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in the normal equations, implying that two constraints must be imposed to solve Equation 
4-18. The usual constraints are 


2 t, = o 


% & = 0 


(4-19) 


Using these constraints, the normal equations simplify considerably. In fact, they become 
abjjb — y_ 
b(L + bf t = y,-. 


whose solution is 


a/1 + afij = y,j 


i= 1,2,..., a 
j = 1, 2,6 


(4-20) 


f; = y L — y_ i = 1, 2,... , a 

Pj = y.j -1. j = i. 2. b 

Using the solution to the normal equation in Equation 4-21, w 
or fitted values of y u as 


(4-21) 


e may find the estimated 


Sij = + %_+ Pj_ 

= y.. + (yi. ~ yJ + (y.j - y..) 

= yt. + y.j ~ y.. 

This result was used previously in Equation 4-13 for computing the residuals from a 
randomized block design. 

The general regression significance test can be used to develop the analysis of vari¬ 
ance for the randomized complete block design. Using the solution to the normal equa¬ 
tions given by Equation 4-21, the reduction in the sum of squares for fitting the full 
model is 


R(li, t, (3) = Ay.. + 2 + 2 Pjy.j 

= y..y.. + 2 O'/. - y.)yu + 2 ( y.j ~ yJy.j 

with a + b — l degrees of freedom, and the error sum of squares is 

ss E = 2 2 yfj ~ R (^ r, P) 

=s 

i= i j=i i= i b j= i a afr 

a b 

= 22 (y« - % - y.j + ^..) 2 
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with (a - 1 )(b — 1) degrees of freedom. Compare this last equation with SS E in Equation 
4-7. 

To test the hypothesis H 0 : t, = 0, the reduced model is 
yij = li + ft + e,y 

which is just a single-factor analysis of variance. By analogy with Equation 3-5, the 
reduction in the sum of squares for fitting the reduced model is 

R(ti, P) = i~ 

7 =i a 

which has b degrees of freedom. Therefore, the sum of squares due to {r, } after fitting 
p, and [Pj] is 

R(T\fl, P) = R(jl, r, p) - R(jl, p) 

= /?(Full Model) - ^(Reduced Model) 

-M 



which we recognize as the treatment sum of squares with a - 1 degrees of freedom 
(Equation 4-10). 

The block sum of squares is obtained by fitting the reduced model 

y>ij = (JL + Ti + €ij 

which is also a single-factor analysis. Again, by analogy with Equation 3-5, the reduction 
in the sum of squares for fitting this model is 

R(^ r) = 2 | 

with a degrees of freedom. The sum of squares for blocks {/S, } after fitting (jl and {r, } 
is 


R(P | AC T) = R(fJL, r, p) - R{fl, r) 

= y. — + y, — - — - y,— 

f+i b a ab b 

= i y ^- yl 

j h a ab 


with b — 1 degrees of freedom, which we have given previously as Equation 4-11. 

We have developed the sums of squares for treatments, blocks, and error in the 
randomized complete block design using the general regression significance test. Al¬ 
though we would not ordinarily use the general regression significance test to actually 
analyze data in a randomized complete block, the procedure occasionally proves useful 
in more general randomized block designs, such as those discussed in Section 4-4. 
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Exact Analysis of the Missing Value Problem 

In Section 4-1.3 an approximate procedure for dealing with missing observations in the 
RCBD was presented. This approximate analysis consists of estimating the missing value 
so that the error mean square is minimized. It can be shown that the approximate analysis 
produces a biased mean square for treatments in the sense that £(MS Treatments ) is larger 
than E{MS e ) if the null hypothesis is true. Consequently, too many significant results are 
reported. 

The missing value problem may be analyzed exactly by using the general regression 
significance test. The missing value causes the design to be unbalanced, and because 
all the treatments do not occur in all blocks, we say that the treatments and blocks are 
not orthogonal. This method of analysis is also used in more general types of randomized 
block designs; it is discussed further in Section 4-4. Problem 4-26 asks the reader to 
perform the exact analysis for a randomized complete block design with one missing 
value. 


4-2 THE LATIN SQUARE DESIGN 

In Section 4-1 we introduced the randomized complete block design as a design to reduce 
the residual error in an experiment by removing variability due to a known and control¬ 
lable nuisance variable. There are several other types of designs that utilize the blocking 
principle. For example, suppose that an experimenter is studying the effects of five dif¬ 
ferent formulations of a rocket propellant used in aircrew escape systems on the observed 
burning rate. Each formulation is mixed from a batch of raw material that is only large 
enough for five formulations to be tested. Furthermore, the formulations are prepared by 
several operators, and there may be substantial differences in the skills and experience 
of the operators. Thus, it would seem that there are two nuisance factors to be “averaged 
out” in the design: batches of raw material and operators. The appropriate design for 
this problem consists of testing each formulation exactly once in each batch of raw 
material and for each formulation to be prepared exactly once by each of five operators. 
The resulting design, shown in Table 4-9, is called a Latin square design. Notice that 
the design is a square arrangement and that the five formulations (or treatments) are 
denoted by the Latin letters A, B, C, D, and E; hence the name Latin square. We see that 
both batches of raw material (rows) and operators (columns) are orthogonal to treatments. 

The Latin square design is used to el imin ate two nuisance sources of variability; that 
is, it systematically allows blocking in two directions. Thus, the rows and columns ac¬ 
tually represent two restrictions on randomization. In general, a Latin square for p factors, 
or a p X p Latin square, is a square containing p rows and p columns. Each of the 


Table 4-9 Latin Square Design for the Rocket Propellant Problem 
Operators 


1 2 3 

A = 24 B = 20 C = 19 

S = 17 C = 24 D = 30 

C = 18 D = 38 E = 26 

D = 26 E = 31 A = 26 

E = 22 A = 30 B = 20 


D = 24 
E = 21 
A = 27 
B = 23 
C = 29 


5 

E = 24 
A = 36 
B = 21 
C = 22 
D = 31 


Batches of 
Raw Material 
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resulting p 2 cells contains one of the p letters that corresponds to the treatments, and each 
letter occurs once and only once in each row and column. Some examples of Latin squares 
are 


4x4 

ABDC 

BCAD 

CDBA 

DACB 


5X5 6x6 

ADBEC ADCEBF 
DACBE BAECFD 
CBEDA CEDFAB 
BEACD DCFBEA 
ECDAB FBADCE 
EFBADC 


The statistical model for a Latin square is 


y iJk = p + a, + tj + fa + € 


1.2 ,...,p 
1,2,..., p 
1.2. p 


(4-22) 


where y iJk is the observation in the ith row and £th column for the yth treatment, p. is the 
overall mean, a, is the zth row effect, r y is the yth treatment effect, (3 k is the kth column 
effect, and e ijk is the random error. Note that this is an effects model. The model is 
completely additive; that is, there is no interaction between rows, columns, and treat¬ 
ments. Because there is only one observation in each cell, only two of the three subscripts 
i, j, and k are needed to denote a particular observation. For example, referring to the 
rocket propellant problem in Table 4-9, if i = 2 and k = 3, we automatically find j = 4 
(formulation D), and if i = 1 and j = 3 (formulation C), we find k = 3. This is a 
consequence of each treatment appearing exactly once in each row and column. 

The analysis of variance consists of partitioning the total sum of squares of the N = 
p 2 observations into components for rows, columns, treatments, and error, for example, 


SS T = SS Rows + SScolumns + treatments + t (4-23) 


with respective degrees of freedom 

p 2 -l=p-l+p-l+p-l+(p- 2 )(p - 1) 

Under the usual assumption that e ijk is NID (0, a 2 ), each sum of squares on the right- 
hand side of Equation 4-23 is, upon division by a 2 , an independently distributed chi- 
square random variable. The appropriate statistic for testing for no differences in 
treatment means is 


F treatments 

° MS e 

which is distributed as F p - Up - 2)(/ ,_i> under the null hypothesis. We may also test for no 
row effect and no column effect by forming the ratio of M5 Rows or MS Co]unms to MS E . 
However, because the rows and columns represent restrictions on randomization, these 
tests may not be appropriate. 

The computational procedure for the analysis of variance is shown in Table 4-10 on 
page 146. From the computational formulas for the sums of squares, we see that the 
analysis is a simple extension of the RCBD, with the sum of squares resulting from rows 
obtained from the row totals. 
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Table 4-10 Analysis of Variance for the Latin Square Design 


Source of 

Sum of 

Degrees of 

Mean 

Variation 

Squares 

Freedom 

Square F 0 

Treatments 

'^Treatments ~ 2 X/. 

P - 1 

‘^'^Treatments p _ ^^Treatments 

p — 1 0 ms £ 

Rnw« 

SS Rows = 1 i yl - | 

P - 1 

SS Rows 




p 1 

Columns 

‘^Columns ^ ^> Xjfc jy 

P ~ 1 

^Columns 

P - 1 

Error 

SS £ (by subtraction) 

(P ~ 2)(p - 1) 

SS E 

(p-2)(p- 1) 

Total 

ss?- = E S E _ ^ 

P 2 - 1 



EXAMPLE 4-3 . 

Consider the rocket propellant problem previously described, where both batches of raw 
material and operators represent randomization restrictions. The design for this experi¬ 
ment, shown in Table 4-9, is a 5 X 5 Latin square. After coding by subtracting 25 from 
each observation, we have the data in Table 4-11. The sums of squares for the total, 
batches (rows), and operators (columns) are computed as follows: 


SS T = 2 2 2 yfjk - 2 = 

= 680 - = 676.00 

25 

^Batches = ~ 2 jf- ~ ^ 

= j [(—14) 2 + 9 2 + 5 2 + 3 2 + 7 2 ] - = 68.00 

•^Operators ~ 2 


= \ [( 18) 2 + 18 2 + (—4) 2 + 5 2 + 9 2 ] - ^ 


Table 4-11 

Coded Data for the Rocket Propellant Problem 





Batches of 
Raw Material 



Operators 





1 

2 

3 

4 


5 

y„. 

1 

A = -1 

B = -5 

C = -6 

Z> = 

-1 

£ = -1 

-14 

2 

fi = —8 

C = -1 

D = 5 

£ = 

2 

A = 11 

9 

3 

C = -7 

D = 13 

£ = 1 

A = 

2 

£ = -4 

5 

4 

D = 1 

£ = 6 

A = 1 

£ = 

-2 

C = -3 

3 

5 

£ = -3 

A = 5 

B = -5 

C = 

4 

D = 6 

7 

At 

-18 

18 

-4 

5 


9 

10 = y... 
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Table 4-12 Analysis of Variance for the Rocket Propellant Experiment 


Source of 

Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

F 0 

P-Value 

Formulations 

330.00 

4 

82.50 

7.73 

0.0025 

Batches of raw material 

68.00 

4 

17.00 



Operators 

150.00 

4 

37.50 



Error 

128.00 

12 

10.67 



Total 

676.00 

24 





The totals for the treatments (Latin letters) are 

Latin Letter Treatment Total 


A y.,. = 18 

B y.2. = -24 

C y. 3. = -13 

D y . 4 . = 24 

E y. 5 . = 5 


The sum of squares resulting from the formulations is computed from these totals as 

£ 

N 


SS t 


formulations 




18 : 


+ (—24) 2 + (-13) 2 + 24 2 + 5 2 _ (10f 
5 25 


330.00 


The error sum of squares is found by subtraction: 

SS E = SS T — ^Batches - SSoperators _ ^Formulations 

= 676.00 - 68.00 - 150.00 - 330.00 = 128.00 

The analysis of variance is summarized in Table 4-12. We conclude that there is a 
significant difference in the mean burning rate generated by the different rocket propellant 
formulations. There is also an indication that there are differences between operators, so 
blocking on this factor was a good precaution. There is no strong evidence of a difference 
between batches of raw material, so it seems that in this particular experiment we were 
unnecessarily concerned about this source of variability. However, blocking on batches 
of raw material is usually a good idea. 


As in any design problem, the experimenter should investigate the adequacy of the 
model by inspecting and plotting the residuals. For a Latin square, the residuals are given 
by 

e ijk = y,jk - Sijk 

= yuk - %. ~ y.j. - y..k + 2y_ 

The reader should find the residuals for Example 4-4 and construct appropriate plots. 
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Table 4-13 Standard Latin Squares and Number of Latin Squares of Various Sizes” 


Size 

3X3 

4X4 

5X5 

6X6 

7X7 

pXp 

Examples of 

ABC 

ABCD 

ABCDE 

ABCDEF 

ABCDEFG 

ABC ...P 

standard squares 

B CA 

BCD A 

BAECD 

BCFADE 

BCDEFGA 

BCD ... A 


CAB 

CDAB 

DABC 

CDAEB 
DEB AC 
ECDBA 

CFBEAD 

DEABFC 

EADFCB 

FDECBA 

CDEFGAB 
DEFGABC 
EFGABCD 
FGABCDE 

GABCDEF 

CDE ...B 

PAB ... (P — 1) 

Number of 

standard squares 

1 

4 

56 

9408 

16,942,080 

— 

Total number of 
Latin squares 

12 

576 

161,280 

818,851,200 

61,479,419,904,000 

p\(p - 1)! X 
(number of 
standard squares) 


° Some of the information in this table is found in Statistical Tables for Biological, Agricultural and Medical Research , 4th edition, by R. A. Fisher and 


A Latin square in which the first row and column consists of the letters written in 
alphabetical order is called a standard Latin square, which is the design used in Ex¬ 
ample 4-4. A standard Latin square can always be obtained by writing the first row in 
alphabetical order and then writing each successive row as the row of letters just above 
shifted one place to the left. Table 4-13 summarizes several important facts about Latin 
squares and standard Latin squares. 

As with any experimental design, the observations in the Latin square should be 
taken in random order. The proper randomization procedure is to select the particular 
square employed at random. As we see in Table 4-13, there are a large number of Latin 
squares of a particular size, so it is impossible to enumerate all the squares and select 
one randomly. The usual procedure is to select a Latin square from a table of such designs, 
as in Fisher and Yates (1953), and then arrange the order of the rows, columns, and 
letters at random. This is discussed more completely in Fisher and Yates (1953). 

Occasionally, one observation in a Latin square is missing. For a pX p Latin square, 
the missing value may be estimated by 


yuk = 


piy'i.. + y'.j. + y'..k) - 2yL 

(P ~ 2)(p - 1 ) 


(4-24) 


where the primes indicate totals for the row, column, and treatment with the missing 
value, and y'_, is the grand total with the missing value. 

Latin squares can be useful in situations where the rows and columns represent 
factors the experimenter actually wishes to study and where there are no randomization 
restrictions. Thus, three factors (rows, columns, and letters), each at p levels, can be 
investigated in only p 2 runs. This design assumes that there is no interaction between the 
factors. More will be said later on the subject of interaction. 


Replication of Latin Squares 

A disadvantage of small Latin squares is that they provide a relatively small number of 
error degrees of freedom. For example, a 3 X 3 Latin square has only two error degrees 
of freedom, a 4 X 4 Latin square has only six error degrees of freedom, and so forth. 
When small Latin squares are used, it is frequently desirable to replicate them to increase 
the error degrees of freedom. 

There are several ways to replicate a Latin square. To illustrate, suppose that the 
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Table 4-14 

Analysis of Variance for 

a Replicated Latin Squai 

:e, Case 1 

Source of 
Variation 

Sum of Squares 

Degrees of Freedom 

Mean Square F 0 

Treatments 

iy/. -£=. 

np ,tl y - J ~ N 

P ~ 1 

SSlreaunents MS Tlca , 1 „cn l s 

p - 1 ms e 

Rows 

np y ‘~ ~ ~N 

P ~ 1 

SS R , ms 
p - 1 

Columns 

1 i a. - § 

np k =\ N 

P ~ 1 

SScolumns 

P - 1 

Replicates 

1 v 2 yL 

n - 1 

^Replicates 

Error 

Subtraction 

(P - 1 )\n(p + 1) - 3] 

ss E 

(P ~ !)[«(/> + 1) - 3] 

Total 

2 2 2 2 yiki - ^ 

V-1 



5X5 Latin square used in Example 4-4 is replicated n times. This could have been done 
as follows: 

1. Use the same batches and operators in each replicate. 

2. Use the same batches but different operators in each replicate (or, equivalently, 
use the same operators but different batches). 

3. Use different batches and different operators. 

The analysis of variance depends on the method of replication. 

Consider case 1, where the same levels of the row and column blocking factors are 
used in each replicate. Let y ijk , be the observation in row i, treatment j, column k, and 
replicate /. There are N = np 2 total observations. The analysis of variance is summarized 
in Table 4-14. 

Now consider case 2 and assume that new batches of raw material but the same 
operators are used in each replicate. Thus, there are now five new rows (in general, p 
new rows) within each replicate. The analysis of variance is summarized in Table 4-15. 


Table 4-15 

Analysis of Variance for a 

Replicated Latin Squar 

e, Case 2 

Source of 
Variation 

Sum of Squares 

Degrees of 
Freedom 

Mean 

Square F 0 

Treatments 

- i a - § 

np j=i N 

P - 1 

S5 Treatments AfS Treatments 

p - 1 ms e 

Rows 

-i t A-ii 

p i=i t=i i=i p 

n(p - 1) 

SS Ro „ s 
n(p - 1) 

Columns 

np k= \ N 

P ~ 1 

SScolumns 

p - 1 

Replicates 

1 V .2 y2 ~ 

p 2 y "' N 

n - 1 

^Replicates 

Error 

Subtraction 

(P ~ 1 )(»p - 1) 

SS E 

(P - 1 )(np ~ 1) 

Total 

2 2 E E ylk, - ^ 

np 2 ~ 1 
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Table 4-16 Analysis of Variance for a Replicated Latin Square, Case 3_ 


Source of 
Variation 

Sum of 
Squares 



Degrees of 

Freedom 

Mean Square F 0 

Treatments 

npj-i 

yL 

N 


P - 1 

SS T ,ea.me n t S MS Treatments 

p - 1 ms e 

Rows 


2 

yL 

p 2 

n{p - 1) 

SS Rows 
n(p - 1) 

Columns 


2 

yL 

p 2 

n(p ~ 1) 

SiS Columns 

n(p - 1) 


1 A 

yL 



SS Replicates 

Replicates 


N 


n - 1 

n - 1 

Error 

Subtraction 

(j> - 

- 1 )[n(p - 1) - 1] — 

SS E 

- 1 )[n(p - 1) - 1] 

Total 

2X22 ” 

yL 

N 

np 2 - 1 



Note that the source of variation for the rows really measures the variation between rows 
within the n replicates. 

Finally, consider case 3, where new batches of raw material and new operators are 
used in each replicate. Now the variation that results from both the rows and columns 
measures the variation resulting from these factors within the replicates. The analysis of 
variance is summarized in Table 4-16. 

There are other approaches to analyzing replicated Latin squares that allow some 
interactions between treatments and squares (refer to Problem 4-19). 

Crossover Designs and Designs Balanced for Residual Effects 

Occasionally, one encounters a problem in which time periods are a factor in the exper¬ 
iment. In general, there are p treatments to be tested in p time periods using np experi¬ 
mental units. For example, a human performance analyst is studying the effect of two 
replacement fluids on dehydration in 20 subjects. In the first period, half of the subjects 
(chosen at random) are given fluid A and the other half fluid B. At the end of the period, 
the response is measured, and a period of time is allowed to pass in which any physio¬ 
logical effect of the fluids is eliminated. Then the experimenter has the subjects who took 
fluid A take fluid B and those who took fluid B take fluid A. This design is called a 
crossover design. It is analyzed as a set of 10 Latin squares with two rows (time periods) 
and two treatments (fluid types). The two columns in each of the 10 squares correspond 
to subjects. 

The layout of this design is shown in Figure 4-7. Notice that the rows in the 
Latin square represent the time periods and the columns represent the subjects. The 10 


Latin Squares 



I 

II 

m 

IV 

V 

VI 

VII 

VIII 

IX 

X 

Subject 

1 2 

3 4 

5 6 

7 8 

9 10 

11 12 

13 14 

15 16 

17 18 

19 20 

Period 1 

A B 

B A 

B A 

A B 

A B 

B A 

A B 

A B 

A B 

A B 

Period 2 

B A 

A B 

A B 

B A 

B A 

A B 

B A 

B A 

B A 

B A 


Figure 4-7 
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Table 4-17 Analysis of Variance 

for the Crossover Design 
in Figure 4-7 


Source of 
Validation 

Degrees of 
Freedom 

Subjects (columns) 

19 

Periods (rows) 

1 

Fluids (letters) 

1 

Error 

18 

Total 

39 


subjects who received fluid A first (1, 4, 6, 7, 9, 12, 13, 15, 17, and 19) are randomly 
determined. 

An abbreviated analysis of variance is summarized in Table 4-17. The subject sum 
of squares is computed as the corrected sum of squares among the 20 subject totals, the 
period sum of squares is the corrected sum of squares among the rows, and the fluid sum 
of squares is computed as the corrected sum of squares among the letter totals. For further 
details of the statistical analysis of these designs, see Cochran and Cox (1957), John 
(1971), and Anderson and McLean (1974). 

It is also possible to employ Latin square type designs for experiments in which the 
treatments have a residual effect—that is, for example, if the data for fluid B in period 
2 still reflected some effect of fluid A taken in period 1. Designs balanced for residual 
effects are discussed in detail by Cochran and Cox (1957) and John (1971). 


4-3 THE GRAECO-LATIN SQUARE DESIGN 

Consider a. p X p Latin square, and superimpose on it a second p X p Latin square in 
which the treatments are denoted by Greek letters. If the two squares when superimposed 
have the property that each Greek letter appears once and only once with each Latin 
letter, the two Latin squares are said to be orthogonal, and the design obtained is called 
a Graeco-Latin square. An example of a 4 X 4 Graeco-Latin square is shown in Table 
4-18. 

The Graeco-Latin square design can be used to control systematically three sources 
of extraneous variability, that is, to block in three directions. The design allows inves¬ 
tigation of four factors (rows, columns, Latin letters, and Greek letters), each at p levels 
in only p 2 runs. Graeco-Latin squares exist for all p s* 3 except p = 6. 


Table 4-18 4X4 Graeco-Latin Square Design 

Column 

Row 1 

2 

3 

4 

1 Aa 

BP 

Cy 

DS 

2 BS 

Ay 

D\3 

Ca 

3 C/3 

Da 

AS 

By 

4 Dy 

CS 

Ba 

Al3 
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Table 4-19 Analysis of Variance for a Graeco-Latin Square Design 


Source of Variation 

Sum of Squares 

Degrees of Freedom 

Latin letter treatments 


P ~ 1 

Greek letter treatments 


P ~ 1 

Rows 


P ~ 1 

Columns 

•^Columns ~ 

P ~ 1 

Error 

SS E (by subtraction) 

(j> ~ 3 ){p - 1) 

Total 

= 2 22 2 *.-*= 

P 2 - 1 


The statistical model for the Graeco-Latin square design is 

yijkl = /A + 0; + Tj + <O k + % + €,yJ J k _ I’ 2 ' ’ ’ p ( 4 ‘ 25 ) 

[l=l, 2 

where y ijk , is the observation in row i and column / for Latin letter j and Greek letter k, 
0, is the effect of the /th row, r y is the effect of Latin letter treatment j, 04 is the effect 
of Greek letter treatment k, % is the effect of column l, and e ijki is an NID (0, a 2 ) random 
error component. Only two of the four subscripts are necessary to completely identify 
an observation. 

The analysis of variance is very similar to that of a Latin square. Because the Greek 
letters appear exactly once in each row and column and exactly once with each Latin 
letter, the factor represented by the Greek letters is orthogonal to rows, columns, and 
Latin letter treatments. Therefore, a sum of squares due to the Greek letter factor may 
be computed from the Greek letter totals and the experimental error is further reduced 
by this amount. The computational details are illustrated in Table 4-19. The null hy¬ 
potheses of equal row, column, Latin letter, and Greek letter treatments would be tested 
by dividing the corresponding mean square by mean square error. The rejection region 
is the upper-tail point of the Fp-i^-^p-^ distribution. 


EXAMPLE 4-4 . 

Suppose that in the rocket propellant experiment of Example 4-3 an additional factor, 
test assemblies, could be of importance. Let there be five test assemblies denoted by the 
Greek letters a, /3, y, 8, and e. The resulting 5X5 Graeco-Latin square design is shown 
in Table 4-20 on the facing page. 

Notice that, because the totals for batches of raw material (rows), operators (columns) 
and formulations (Latin letters) are identical to those in Example 4-3, we have 


^Batches 68.00 SSoperators 


= 150.00 and SS f 
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Table 4-20 Graeco-Latin Square Design for the Rocket Propellant Problem 


Batches of 
Raw Material 




Operators 




yt- 

l 

2 


3 

4 


5 

1 

Aot = -1 

By = 

-5 

Ce = -6 

D/3 = 

-1 

E8 = -1 

-14 

2 

B)3 = -8 

C8 = 

-1 

Da= 5 

Ey = 

2 

Ae = 11 

9 

3 

Cy=-1 

De = 

13 

E0= 1 

A8 = 

2 

Ba = -4 

5 

4 

D8= 1 

Ea = 

6 

Ay = 1 

Bt = 

-2 

CP = -3 

3 

5 

£e = -3 

Af) = 

5 

B8= -5 

Ca = 

4 

Dy = 6 

7 

y...i 

-18 

18 


-4 

5 


9 

10 = y„ 


The totals for the test assemblies (Greek letters) are 


Greek Letter 

a 


y 

s 


Test Assembly Total 

y_i.= 10 

y.. 2 . = - 6 
y.. 3 . = - 3 
3U. = - 4 
y..s. = 13 


Thus, the sum of squares due to the test assemblies is 

= | [10 2 + (—6) 2 + (—3) 2 + (—4) 2 + 13 2 ] - ^ 


The complete analysis of variance is summarized in Table 4-21. Formulations are sig¬ 
nificantly different at 1 percent. In comparing Tables 4-21 and 4-12, we observe that 
removing the variability due to test assemblies has decreased the experimental error. 
However, in decreasing the experimental error, we have also reduced the error degrees 
of freedom from 12 (in the Latin square design of Example 4-3) to 8. Thus, our estimate 
of error has fewer degrees of freedom, and the test may be less sensitive. 


Table 4-21 Analysis of Variance for the Rocket Propellant Problem 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean Square 

F 0 

P-Value 

Formulations 

330.00 

4 

82.50 

10.00 

0.0033 

Batches of raw material 

68.00 

4 

17.00 



Operators 

150.00 

4 

37.50 



Test assemblies 

62.00 

4 

15.50 



Error 

66.00 

8 

8.25 



Total 

676.00 

24 
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The concept of orthogonal pairs of Latin squares forming a Graeco-Latin square can 
be extended somewhat. A p X p hypersquare is a design in which three or more or¬ 
thogonal p X p Latin squares are superimposed. In general, up to p + 1 factors could 
be studied if a complete set of p — 1 orthogonal Latin squares is available. Such a design 
would utilize all (p + 1 )(p — 1) = p 2 — 1 degrees of freedom, so an independent 
estimate of the error variance is necessary. Of course, there must be no interactions 
between the factors when using hypersquares. 


4-4 BALANCED INCOMPLETE BLOCK DESIGNS 

In certain experiments using randomized block designs, we may not be able to run all 
the treatment combinations in each block. Situations like this usually occur because of 
shortages of experimental apparatus or facilities or the physical size of the block. For 
example, in the hardness testing experiment (Example 4-1), suppose that because of their 
size each coupon can be used only for testing three tips. Therefore, each tip cannot be 
tested on each coupon. For this type of problem it is possible to use randomized block 
designs in which every treatment is not present in every block. These designs are known 
as randomized incomplete block designs. 

When all treatment comparisons are equally important, the treatment combinations 
used in each block should be selected in a balanced manner, that is, so that any pair of 
treatments occur together the same number of times as any other pair. Thus, a balanced 
incomplete block design (BIRD) is an incomplete block design in which any two treat¬ 
ments appear together an equal number of times. Suppose that there are a treatments and 
that each block can hold exactly k (k < a) treatments. A balanced incomplete block 
design may be constructed by taking (?) blocks and assigning a different combination of 
treatments to each block. Frequently, however, balance can be obtained with fewer than 
(?) blocks. Tables of BIBDs are given in Fisher and Yates (1953), Davies (1956), and 
Cochran and Cox (1957). 

As an example, suppose that a chemical engineer thinks that the time of reaction for 
a chemical process is a function of the type of catalyst employed. Four catalysts are 
currently being investigated. The experimental procedure consists of selecting a batch of 
raw material, loading the pilot plant, applying each catalyst in a separate run of the pilot 
plant, and observing the reaction time. Because variations in the batches of raw material 
may affect the performance of the catalysts, the engineer decides to use batches of raw 
material as blocks. However, each batch is only large enough to permit three catalysts 


Table 4-22 Balanced Incomplete Block Design 
for Catalyst Experiment 


Teratment 

(Catalyst) 

Block (Batch of Raw Material) 

yt. 

1 

2 

3 

4 

1 

73 

74 

— 

71 

218 

2 

— 

75 

67 

72 

214 

3 

73 

75 

68 

— 

216 

4 

75 

— 

72 

75 

222 

y.j 

221 

224 

207 

218 

870 = y„ 
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to be run. Therefore, a randomized incomplete block design must be used. The balanced 
incomplete block design for this experiment, along with the observations recorded, is 
shown in Table 4-22 at the bottom of the previous page. The order in which the catalysts 
are run in each block is randomized. 


4-4.1 Statistical Analysis of the BIBD 

As usual, we assume that there are a treatments and b blocks. In addition, we assume 
that each block contains k treatments, that each treatment occurs r times in the design 
(or is replicated r times), and that there are N = ar = bk total observations. Furthermore, 
the number of times each pair of treatments appears in the same block is 

, r(k - 1 ) 

a_ T^T 

If a = b, the design is said to be symmetric. 

The parameter A must be an integer. To derive the relationship for A, consider any 
treatment, say treatment 1. Because treatment 1 appears in r blocks and there are k - 1 
other treatments in each of those blocks, there are r(k — 1) observations in a block 
containing treatment 1. These r(k — 1) observations also have to represent the remaining 
a - 1 treatments A times. Therefore, A (a - 1 ) = r(k — 1 ). 

The statistical model for the BIBD is 

yij = (i + t ; + fa + e, y ( 4 - 26 ) 

where y tj is the ith observation in the y'th block, fx, is the overall mean, r, is the effect of 
the ith treatment, j8 ; is the effect of the y'th block, and e, 7 is the NID (0, <r 2 ) random error 
component. The total variability in the data is expressed by the total corrected sum of 
squares: 

SSr = 2 2 yfj - | (4-27) 

Total variability may be partitioned into 

SS T = *5*^Treatments(adjusted) T ^Blocks 4" 

where the sum of squares for treatments is adjusted to separate the treatment and the 
block effects. This adjustment is necessary because each treatment is represented in a 
different set of r blocks. Thus, differences between unadjusted treatment totals y L , 
y 2 .,..., y a _ are also affected by differences between blocks. 

The block sum of squares is 

= ( 4 - 28 ) 

where y mj is the total in the jth block. SS Hkxks has b — 1 degrees of freedom. The adjusted 
treatment sum of squares is 
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Table 4-23 Analysis of Variance for the Balanced Incomplete Block Design 


Source of 
Variation 

Sum of Squares 

Degrees of 
Freedom 

Mean Square F 0 

Treatments 



S ^TreatmentsCadjusted) MS Treatments(adjusted) 

(adjusted) 

A a 

1 

a - 1 F ° MS e 

Blocks 


b - 1 

SS B locks 

b - 1 

Error 

SS E (by subtraction) 

N-a-b +1 

ss E 

N - a - b + 1 

Total 


N - 1 



where Q, is the adjusted total for the ith treatment, which is computed as 

Qi = yu ~ ^ 2 n tjy.j i= 1 , 2 ,..., a (4-30) 

with riij - 1 if treatment i appears in block j and n tJ = 0 otherwise. The adjusted treatment 
totals will always sum to zero. 55 Tr e a tments(adjusted) has a - 1 degrees of freedom. The error 
sum of squares is computed by subtraction as 

SS E = SS T — tSiS Treatme nts( adjusted) — -^Blocks (4-31) 

and has N — a — b + 1 degrees of freedom. 

The appropriate statistic for testing the equality of the treatment effects is 

AfS Treatments(ad j us ted) 

F ° = - ZR - 

MS e 


The analysis of variance is summarized in Table 4-23. 


EXAMPLE 4-5 . 

Consider the data in Table 4-22 for the catalyst experiment. This is a BIBD with a = 4, 
b = 4, k = 3, r = 3, A = 2, and N = 12. The analysis of this data is as follows. The 
total sum of squares is 

ss r = E E yfj 

= 63,156 - 


The block sum of squares is found from Equation 4-28 as 


SS Blocks 


= 1 V y 2 - £- 
3 pi y - J 12 

= | [(221) 2 + (207) 2 + (224) 2 + (218) 2 ] - = 55.00 
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Table 4-24 Analysis of Variance for Example 4-5 


Source of 

Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

Treatments (adjusted 

22.75 

3 

7.58 

11.66 

0.0107 

for blocks) 






Blocks 

55.00 

3 

— 



Error 

3.25 

5 

0.65 



Total 

81.00 

11 





To compute the treatment sum of squares adjusted for blocks, we first determine the 
adjusted treatment totals using Equation 4-30 as 

Qi = (218) - |(221 + 224 + 218) = -9/3 

Q 2 = (214) - 5(207 + 224 + 218) = -7/3 

Q z = (216) - |(221 + 207 + 224) = -4/3 

Q 4 = (222) - f(221 + 207 + 218) = 20/3 

The adjusted sum of squares for treatments is computed from Equation 4-29 as 

*ie? 

^^Treatmemst adjusted) 

_ 3[(—9/3) 2 + (—7/3) 2 + (—4/3) 2 + (20/3) 2 ] _ 

(2)(4) 

The error sum of squares is obtained by subtraction as 

SS E = SS T — t5Sxreatments( adjusted) — ^Blocks 

= 81.00 - 22.75 - 55.00 = 3.25 

The analysis of variance is shown in Table 4-24. Because the P-value is small, we 
conclude that the catalyst employed has a significant effect on the time of reaction. 


If the factor under study is fixed, tests on individual treatment means may be of 
interest. If orthogonal contrasts are employed, the contrasts must be made on the adjusted 
treatment totals, the {Q,} rather than the {y,.}. The contrast sum of squares is 

*(i c / q ) 

SS C = 1 a - 

A a 2 C 1 

where {c,} are the contrast coefficients. Other multiple comparison methods may be 
used to compare all the pairs of adjusted treatment effects, which we will find in Section 
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4-4.2, are estimated by f, = kQJ(Xa). The standard error of an adjusted treatment effect 
is 


5 = 



(4-32) 


In the analysis that we have described the total sum of squares has been partitioned 
into an adjusted sum of squares for treatments, an unadjusted sum of squares for blocks, 
and an error sum of squares. Sometimes we would like to assess the block effects. To 
do this, we require an alternate partitioning of SS T , that is, 


SS T — SS-ireatments + ^BIocks(adjusted) + SS E 

Here SS Treatmetas is unadjusted. If the design is symmetric, that is, if a = b, a simple 
formula may be obtained for SS B iocks(adjusted)- Th e adjusted block totals are 


n ‘jy>- J= !> 2 >• • 


(4-33) 


and 


'■JBlockst adjusted) 



The BIBD in Example 4-5 is symmetric because a = b = 4. Therefore, 
Qi = (221) - |(218 + 216 + 222) = 7/3 

Qi = (224) - 1(218 + 214 + 216) = 24/3 

QZ = (207) - |(214 + 216 + 222) = -31/3 

Q' a = (218) - 5(218 + 214 + 222) = 0 


and 


3[(7/3) 2 + (24/3) 2 + (-31/3) 2 + (0) 2 ] = 66Qg 
(2)(4) 


(4-34) 


Also, 


SSi 


(218) 2 + (214) 2 + (216) 2 + (222) 2 (870) 2 

3 12 


11.67 


Table 4-25 Analysis of Variance for Example 4-5, Including Both Treatments and Blocks 


Sum of Degrees of Mean 

Source of Variation Squares_Freedom_Square_ Fo _ P- Value 


Treatments (adjusted) 22.75 

Treatments (unadjusted) 11.67 

Blocks (unadjusted) 55.00 

Blocks (adjusted) 66.08 

Error 3.25 

Total 81.00 


7.58 11.66 0.0107 

22.03 33.90 0.0010 

0.65 
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A summary of the analysis of variance for the symmetric BIBD is given in Table 
4-25 at the bottom of the previous page. Notice that the sums of squares associated with 
the mean squares in Table 4-25 do not add to the total sum of squares, that is, 

SS T A SS Treatments(ad j usted) + SS B locks(adjusted) + SS E 
This is a consequence of the nonorthogonality of treatments and blocks. 


Computer Output 

There are several computer packages that will perform the analysis for a balanced in¬ 
complete block design. The SAS General Linear Models procedure is one of these, 
and Minitab, a widely used PC statistics package, is another. The upper portion of Table 
4-26 (page 160) is the Minitab General Linear Model output for Example 4-5. Comparing 
Tables 4-26 and 4-25, we see that Minitab has computed the adjusted treatment sum of 
squares and the adjusted block sum of squares (they are called “Adj SS” in the Minitab 
output). 

The lower portion of Table 4-26 is a multiple comparison analysis, using the Tukey 
method. Confidence intervals on the differences in all pairs of means and the Tukey test 
are displayed. Notice that the Tukey method would lead us to conclude that catalyst 4 is 
different from the other three. 


4-4.2 Least Squares Estimation of the Parameters 

Consider estimating the treatment effects for the BIBD model. The least squares normal 
equations are 

v-m + r 2 * t + *2 A = y. 

r,: rjl + rfj + 2 n yPj = y,-. i = 1, 2,.... a (4-35) 

(3/: kpL + 5 ) + left = y.j j = 1 , 2 , ..., b 

Imposing 2 t, = 2A = 0, we find that (L = y... Furthermore, using the equations for 
{{}j} to eliminate the block effects from the equations for {t,}, we obtain 

rkfi - rfi - 2 2 n u n pAp = hi. ~ 2 n ijy.j (4-36) 

p*i 

Note that the right-hand side of Equation 4-36 is kQ h where Q t is the ith adjusted treat¬ 
ment total (see Equation 4-29). Now, because 2* = , = A if p & i and n% = n pj 

(because n pj — 0 or 1), we may rewrite Equation 4-36 as 


r(k - l)fi - A 2 = kQi 


i = 1, 2 ,... 


(4-37) 




Table 4-26 Minitab (General Linear Model) Analysis for Example 4-5 


General Linear Model 


Factor 

Type 

Levels Values 




Catalyst 

fixed 

4 1 

2 3 4 




Block 

fixed 

4 1 

2 3 4 




Ana lysis 

of Variance for 

Time, using 

Adjusted 

SS for 

Tests 

Source 

DF 

Seq SS 

Adj SS 

Adj MS 

F 

P 

Catalyst 

3 

11.667 

22.750 

7.583 

11.67 

0.011 

Block 

3 

66.083 

66.083 

22.028 

33.89 

0.001 

Error 

5 

3.250 

3.250 

0.650 



Total 

11 

81 .000 






Tukey 95.0% Simultaneous Confidence Intervals 
Response Variable Time 

All Pairwise Comparisons among Levels of Catalyst 


( - * - ) 

( - * - ) 

( - * - ) 

-+-+-+- 

0.0 2.5 5.0 


( - -*-) 

< - * - ) 

0.0 2.5 5.0 


+ ( - -*--) 

■ -+-+-+ -■ 

0.0 2.5 5.0 


Response Variable Time 

All Pairwise Comparisons among Levels of Catalyst 


Catalyst 

Level 

= 1 subtracted 

Di fference 

SE of 


Adjusted 

Catalyst 

of Means 

Di fference 

T-Value 

P-Va lue 

2 

0.2500 

0.6982 

0.3581 

0.9825 

3 

0.6250 

0.6982 

0.8951 

0.8085 

4 

3.6250 

0.6982 

5.1918 

0.0130 

Catalyst 

Level 

= 2 subtracted 

Di fference 

f rom: 

SE of 


Adjusted 

Catalyst 

of Means 

Difference 

T-Value 

P-Value 

3 

0.3750 

0.6982 

0.5371 

0.9462 

4 

3.3750 

0.6982 

4.8338 

0.0175 

Catalyst 

Level 

= 3 subtracted 

Di fference 

from: 

SE of 


Adjusted 

Catalyst 

of Means 

Di fference 

T-Value 

P-Value 

4 

3.000 

0.6982 

4.297 

0.0281 


Catalyst = 1 subtracted from: 


-2.327 
-1.952 
1 .048 


Center 

0.2500 

0.6250 

3.6250 


Upper 

2.827 

3.202 

6.202 


Catalyst = 2 subtracted from: 

Catalyst Lower Center Upper 

3 -2.202 0.3750 2.952 

4 0.798 3.3750 5.952 


Catalyst 

Catalyst 


3 subtracted from: 


Tukey Simultaneous Tests 
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Finally, note that the constraint S“ =1 f ; = 0 implies that Sp=if p = — f> and recall that 
r(£ — 1) = A(a - 1) to obtain 

A at, = £0, 1 te 1, 2,..., a (4-38) 

Therefore, the least squares estimators of the treatment effects in the balanced incomplete 
block model are 


f, = — i = 1, 2,.... a (4-39) 

A a 

As an illustration, consider the BIBD in Example 4.5. Because 0, = -9/3, 0 2 = 
-7/3, & = -4/3, and 0 4 = 20/3, we obtain 

„ 3(—9/3) 

T ’ (2)(4) 

„ 3(—4/3) 

T3 (2)(4) 

as we found in Section 4-4.1. 


4-4.3 Recovery of Interblock Information in the BIBD 

The analysis of the BIBD given in Section 4-4.1 is usually called the intrablock analysis 
because block differences are eliminated and all contrasts in the treatment effects can be 
expressed as comparisons between observations in the same block. This analysis is ap¬ 
propriate regardless of whether the blocks are fixed or random. Yates (1940) noted that, 
if the block effects are uncorrelated random variables with zero means and variance cr%, 
one may obtain additional information about the treatment effects r, . Yates called the 
method of obtaining this additional information the interblock analysis. 

Consider the block totals y.j as a collection of b observations. The model for these 
observations [following John (1971)] is 

y.j = kfi + 2 n ij T i + {kfij + 2 (4-40) 

where the term in parentheses may be regarded as error. The interblock estimators of /jl 
and T,- are found by minimizing the least squares function 

L = S ^.7 - “ J) n (frj 

This yields the following least squares normal equations: 

lx:N(i + r 2 T = X. 

Tj'.krfi + rf t + A ^ t p = ^ n ijy.j 1 = 1. 2 


(4-41) 


= -9/8 


_ 3(—7/3) _ 

(2X4) - 7/8 

3(20/3) 

*• = = 20/8 
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where jl and t, denote the interblock estimators. Imposing the constraint 2f = , f, = 0, 
we obtain the solutions to Equations 4-41 as 

AL = (4-42) 

2 n ijy.j ~ by- 

f■, = J=l - i = 1,2,a (4-43) 

It is possible to show that the interblock estimators {t>} and the intrablock estimators 
{ Tj ) are uncorrelated. 

The interblock estimators {t,} can differ from the intrablock estimators {f,■}. For 
example, the interblock estimators for the BIBD in Example 4-5 are computed as follows: 

663 

Ti = - 

649 
t 2 =- 

. 652 

646 
t 4 =- 

Note that the values of '2j =l n iJ y J were used previously on page 157 in computing the 
adjusted treatment totals in the intrablock analysis. 

Now suppose we wish to combine the interblock and intrablock estimators to obtain 
a single, unbiased, minimum variance estimate of each 77 . It is possible to show that both 
t, and f, are unbiased and also that 

V{f t ) = —— 5 —^ (T 2 (intrablock) 

A a 

and 

V(t,> = (<r 2 + kal) (interblock) 

a{r - A) 

We use a linear combination of the two estimators, say 

r* = a,f, + a 2 Ti (4-44) 

to estimate r,. For this estimation method, the minimum variance unbiased combined 
estimator rf should have weights aj = uj(u x + « 2 ) and a 2 = u-J(u x + n 2 ), where m, = 
1/V(fj) and u 2 = l/V(f ,). Thus, the optimal weights are inversely proportional to the 
variances of t, and f,. This implies that the best combined estimator is 


- (3X3X72.50) 

3-2 

- (3X3X72.50) = _ 3 50 

- (3X3X72.50) = _ 0 50 
-(3X3X72.50), _ 65Q 
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which can be simplified to 

kQi(a 2 + kcr 2 p) + | 

T ' (r — \)a 2 + 

Unfortunately, Equation 4-45 cannot be used to estimate the r, because the variances 
< T 2 and a% are unknown. The usual approach is to estimate a 2 and <r% from the data and 
replace these parameters in Equation 4-45 by the estimates. The estimate usually taken 
for a 2 is the error mean square from the intrablock analysis of variance, or the intrablock 
error. Thus, 

& 2 = MS e 


2 n uy.j - ^o'..jo' 2 


\a(a 2 + ka 2 p) 


1,2,..., a (4-45) 


The estimate of a} is found from the mean square for blocks adjusted for treatments. In 
general, for a balanced incomplete block design, this mean square is 


*2 Qf 


If-!? 


C b - 1) 


(4-46) 


and its expected value [which is derived in Graybill (1961)] is 


__ n 2 _ air - 1) _ 2 

E[MSBlocks(adjusted)] — O' b ~ 1 ^ 


Thus, if M5 B iocks(adjusted) > MS E , the estimate of &% is 

, [M5 Bl ocks(adjus.ed, ~ MS E ](b ~ 1) 

~ a{r - 1) 

and if M5 Bl0C ks( a djusted) ^ MS E , we set = 0. This results in the combined estimator 


(4-47) 


'kQ t (& 2 + k&%) + |^2 n uy.j ~ kr y..J &2 

(r - X)& 2 + A a(& 2 + k&p) 
y L - (1 /a)y_ 


& 2 p > 0 (4-48a) 

& l = 0 (4-48b) 


We now compute the combined estimates for the data in Example 4-5. From Table 
4-25 we obtain & 2 = MS E = 0.65 and MS Blocks(adjusted) = 22.03. (Note that in computing 
MS B iocks(adjusted) we make use of the fact that this is a symmetric design. In general, we 
must use Equation 4-46.) Because MS Blocks(adjuste d) > MS E , we use Equation 4-47 to 
estimate as 


<t 


P 


(22.03 - 0.65X3) _ 
4(3 - 1) 


Therefore, we may substitute & 2 = 0.65 and &% = 8.02 into Equation 4-48a to obtain 
the combined estimates listed below. For convenience, the intrablock and interblock 
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estimates are also given. In this example, the combined estimates are close to the intra¬ 
block estimates because the variance of the interblock estimates is relatively large. 


Parameter 

Intrablock Estimate 

Interblock Estimate 

Combined Estimate 

Ti 

-1.12 

10.50 

-1.09 

t 2 

-0.88 

-3.50 

-0.88 

73 

-0.50 

-0.50 

-0.50 

74 

2.50 

-6.50 

2.47 


4-5 PROBLEMS - 

4-1. A chemist wishes to test the effect of four chemical agents on the strength of a particular 
type of cloth. Because there might be variability from one bolt to another, the chemist 
decides to use a randomized block design, with the bolts of cloth considered as blocks. 
She selects five bolts and applies all four chemicals in random order to each bolt. The 
resulting tensile strengths follow. Analyze the data from this experiment (use a = 0.05) 
and draw appropriate conclusions. 


1 73 

2 73 

3 75 

4 73 


Bolt 

2 3 4 

68 74 71 

67 75 72 

68 78 73 

71 75 75 


_5_ 

67 
70 

68 
69 


4-2. Three different washing solutions are being compared to study their effectiveness in re¬ 
tarding bacteria growth in 5-gallon milk containers. The analysis is done in a laboratory, 
and only three trials can be run on any day. Because days could represent a potential 
source of variability, the experimenter decides to use a randomized block design. Obser¬ 
vations are taken for four days, and the data are shown here. Analyze the data from this 
experiment (use a = 0.05) and draw conclusions. 


Solution 


Days 


1 

2 

3 

4 

1 

13 

22 

18 

39 

2 

16 

24 

17 

44 

3 

5 

4 

1 

22 


4-3. Plot the mean tensile strengths observed for each chemical type in Problem 4-1 and com¬ 
pare them to an appropriately scaled t distribution. What conclusions would you draw 
from this display? 

4-4. Plot the average bacteria counts for each solution in Problem 4-2 and compare them to a 
scaled t distribution. What conclusions can you draw? 

4-5. An article in the Fire Safety Journal (“The Effect of Nozzle Design on the Stability and 
Performance of Turbulent Water Jets,” Vol. 4, August 1981) describes an experiment in 
which a shape factor was determined for several different nozzle designs at six levels of 
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jet efflux velocity. Interest focused on potential differences between nozzle designs, with 
velocity considered as a nuisance variable. The data are shown below: 


Nn77 l p 


Jet Efflux Velocity (m/s) 


Design 

11.73 

14.37 

16.59 

20.43 

23.46 

28.74 

1 

0.78 

0.80 

0.81 

0.75 

0.77 

0.78 

2 

0.85 

0.85 

0.92 

0.86 

0.81 

0.83 

3 

0.93 

0.92 

0.95 

0.89 

0.89 

0.83 

4 

1.14 

0.97 

0.98 

0.88 

0.86 

0.83 

5 

0.97 

0.86 

0.78 

0.76 

0.76 

0.75 


(a) Does nozzle design affect the shape factor? Compare the nozzles with a scatter plot 
and with an analysis of variance, using a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Which nozzle designs are different with respect to shape factor? Draw a graph of 
the average shape factor for each nozzle type and compare this to a scaled t distri¬ 
bution. Compare the conclusions that you draw from this plot to those from Duncan’s 
multiple range test. 

4-6. Consider the ratio control algorithm experiment described in Chapter 3, Section 3.8. The 
experiment was actually conducted as a randomized block design, where six time periods 
were selected as the blocks, and all four ratio control algorithms were tested in each time 
period. The average cell voltage and the standard deviation of voltage (shown in paren¬ 
theses) for each cell are as follows: 


Ratio 



Time Period 



Control 

Algorithm 

1 

2 

3 

4 

5 

6 

1 

4.93 (0.05) 

4.86 (0.04) 

4.75 (0.05) 

4.95 (0.06) 

4.79 (0.03) 

4.88 (0.05) 

2 

4.85 (0.04) 

4.91 (0.02) 

4.79 (0.03) 

4.85 (0.05) 

4.75 (0.03) 

4.85 (0.02) 

3 

4.83 (0.09) 

4.88 (0.13) 

4.90(0.11) 

4.75 (0.15) 

4.82 (0.08) 

4.90 (0.12) 

4 

4.89 (0.03) 

4.77 (0.04) 

4.94 (0.05) 

4.86 (0.05) 

4.79 (0.03) 

4.76 (0.02) 


(a) Analyze the average cell voltage data. (Use a = 0.05.) Does the choice of ratio 
control algorithm affect the average cell voltage? 

(b) Perform an appropriate analysis on the standard deviation of voltage. (Recall that 
this is called “pot noise.”) Does the choice of ratio control algorithm affect the pot 
noise? 

(c) Conduct any residual analyses that seem appropriate. 

(d) Which ratio control algorithm would you select if your objective is to reduce both 
the average cell voltage and the pot noise? 

4-7. An aluminum master alloy manufacturer produces grain refiners in ingot form. The com¬ 
pany produces the product in four furnaces. Each furnace is known to have its own unique 
operating characteristics, so any experiment run in the foundry that involves more than 
one furnace will consider furnaces as a nuisance variable. The process engineers suspect 
that stirring rate affects the grain size of the product. Each furnace can be run at four 
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different stirring rates. A randomized block design is run for a particular refiner and the 
resulting grain size data is shown below: 


Stirring Rate (rpm) 


Furnace 


1 

2 

3 

4 

5 

8 

4 

5 

6 

10 

14 

5 

6 

9 

15 

14 

6 

9 

2 

20 

17 

9 

3 

6 


(a) Is there any evidence that stirring rate affects grain size? 

(b) Graph the residuals from this experiment on a normal probability plot. Interpret this 
plot. 

(c) Plot the residuals versus furnace and stirring rate. Does this plot convey any useful 
information? 

(d) What should the process engineers recommend concerning the choice of stirring rate 
and furnace for this particular grain refiner if small grain size is desirable? 

4-8. Analyze the data in Problem 4-2 using the general regression significance test. 

4-9. Assuming that chemical types and bolts are fixed, estimate the model parameters r, and 
fa in Problem 4-1. 

4-10. Draw an operating characteristic curve for the design in Problem 4-2. Does the test seem 
to be sensitive to small differences in the treatment effects? 

4-11. Suppose that the observation for chemical type 2 and bolt 3 is missing in Problem 4-1. 
Analyze the problem by estimating the missing value. Perform the exact analysis and 
compare the results. 

4-12. Two missing values in a randomized block. Suppose that in Problem 4-1 the observations 
for chemical type 2 and bolt 3 and chemical type 4 and bolt 4 are missing. 

(a) Analyze the design by iteratively estimating the missing values, as described in 
Section 4-1.3. 

(b) Differentiate SS E with respect to the two missing values, equate the results to zero, 
and solve for estimates of the missing values. Analyze the design using these two 
estimates of the missing values. 

(c) Derive general formulas for estimating two missing values when the observations 
are in different blocks. 

(d) Derive general formulas for estimating two missing values when the observations 
are in the same block. 

4-13. An industrial engineer is conducting an experiment on eye focus time. He is interested in 
the effect of the distance of the object from the eye on the focus time. Four different 
distances are of interest. He has five subjects available for the experiment. Because there 
may be differences among individuals, he decides to conduct the experiment in a random¬ 
ized block design. The data obtained follow. Analyze the data from this experiment (use 
a = 0.05) and draw appropriate conclusions. 


Distance (ft) 


6 

8 


Subject 

1 2 3 4 5 
10 6 6 6 6 
7 6 6 1 6 
5 3 3 2 5 


10 
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4-14. The effect of five different ingredients (A, B, C, D, E) on the reaction time of a chemical 
process is being studied. Each batch of new material is only large enough to permit five 
runs to be made. Furthermore, each run requires approximately 1§ hours, so only five runs 
can be made in one day. The experimenter decides to run the experiment as a Latin square 
so that day and batch effects may be systematically controlled. She obtains the data that 
follow. Analyze the data from this experiment (use a = 0.05) and draw conclusions. 


Batch 



Day 



1 

2 

3 

4 

5 

1 

A = 8 

B = 7 

D = 1 

C = 7 

E = 3 

2 

C = 11 

E = 2 

A = 7 

D = 3 

B = 8 

3 

B = 4 

A = 9 

C = 10 

E = 1 

D = 5 

4 

D = 6 

C = 8 

E = 6 

B = 6 

A = 10 

5 

E = 4 

D = 2 

B = 3 

A = 8 

C = 8 


4-15. An industrial engineer is investigating the effect of four assembly methods (A, B, C, D) 
on the assembly time for a color television component. Four operators are selected for the 
study. Furthermore, the engineer knows that each assembly method produces such fatigue 
that the time required for the last assembly may be greater than the time required for the 
first, regardless of the method. That is, a trend develops in the required assembly time. 
To account for this source of variability, the engineer uses the Latin square design shown 
below. Analyze the data from this experiment (a = 0.05) and draw appropriate 
conclusions. 


Order of 
Assembly 


Operator 


1 

2 

3 

4 

1 

C = 10 

D = 14 

A = 7 

B = 8 

2 

B = 7 

C = 18 

D = 11 

A = 8 

3 

A = 5 

B = 10 

C = 11 

D = 9 

4 

D = 10 

A = 10 

B = 12 

C = 14 


4-16. Suppose that in Problem 4-14 the observation from batch 3 on day 4 is missing. Estimate 
the missing value from Equation 4-24, and perform the analysis using the value. 

4-17. Consider a p X p Latin square with rows (a,), columns 03*), and treatments (t ■■) fixed. 
Obtain least squares estimates of the model parameters a„ /3*, and r y . 

4-18. Derive the missing value formula (Equation 4-24) for the Latin square design. 

4-19. Designs involving several Latin squares. [See Cochran and Cox (1957), John (1971).] 
The p X p Latin square contains only p observations for each treatment. To obtain more 
replications the experimenter may use several squares, say n. It is immaterial whether the 
squares used are the same or different. The appropriate model is 


1 , 2 ,...,/? 
1 , 2 ,...,/? 
1 , 2 ,...,/? 
1, 2,..., n 


yijkh = p + Ph 


+ T i + Pm + ( T Pb + *ijkh\ 
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where y ijkh is the observation on treatment j in row i and column k of the hth square. Note 
that a m and 0 k(h) are the row and column effects in the /ith square, p h is the effect of the 
/?th square, and ( Tp) jh is the interaction between treatments and squares. 

(a) Set up the normal equations for this model, and solve for estimates of the model 
parameters. Assume that appropriate side conditions on the parameters are 2 7 ,p ; , = 
0, 2,= 0, and 2 t /3 M) = 0 for each h, 2 /fj = 0, 2 y (fp) /V , = 0 for each h, and 
2/,(fp)y„ = 0 for each j. 

(b) Write down the analysis of variance table for this design. 

4-20. Discuss how the operating characteristics curves in the Appendix may be used with the 
Latin square design. 

4-21. Suppose that in Problem 4-14 the data taken on day 5 were incorrectly analyzed and had 
to be discarded. Develop an appropriate analysis for the remaining data. 

4-22. The yield of a chemical process was measured using five batches of raw material, five 
acid concentrations, five standing times (A, B, C, D, E), and five catalyst concentrations 
(a, 0, y, 8, e). The Graeco-Latin square that follows was used. Analyze the data from 
this experiment (use a = 0.05) and draw conclusions. 


Batch 


Acid Concentration 



1 

2 

3 

4 

5 

1 

Aa = 26 

B0= 16 

Cy = 19 

D8= 16 

Ee = 13 

2 

By = 18 

C8= 21 

De = 18 

Ea= 11 

A0 = 21 

3 

Ce = 20 

Da = 12 

El3 = 16 

Ay = 25 

B8 = 13 

4 

D0 = 15 

Ey= 15 

A8 = 22 

Be = 14 

Ca = 17 

5 

E8= 10 

Ae = 24 

Ba = 17 

Cl3 = 17 

Dy= 14 


4-23. Suppose that in Problem 4-15 the engineer suspects that the workplaces used by the four 
operators may represent an additional source of variation. A fourth factor, workplace (a, 
0, y, 8) may be introduced and another experiment conducted, yielding the Graeco-Latin 
square that follows. Analyze the data from this experiment (use a = 0.05) and draw 
conclusions. 


Order of 
Assembly 


Operator 


1 

2 

3 

4 

1 

Ci3= 11 

By = 10 

D8= 14 

Aa = 8 

2 

Ba = 8 

C8= 12 

Ay = 10 

D0 = 12 

3 

AS = 9 

Da = 11 

B0 = 7 

Cy = 15 

4 

Dy= 9 

A )3 = 8 

Ca = 18 

B8= 6 


4-24. Construct a 5 X 5 hypersquare for studying the effects of five factors. Exhibit the analysis 
of variance table for this design. 

4-25. Consider the data in Problems 4-15 and 4-23. Suppressing the Greek letters in 4-23, 
analyze the data using the method developed in Problem 4-19. 

4-26. Consider the randomized block design with one missing value in Table 4-7. Analyze this 
data by using the exact analysis of the missing value problem discussed in Section 4-1.4. 
Compare your results to the approximate analysis of these data given in Table 4-8. 

4-27. An engineer is studying the mileage performance characteristics of five types of gasoline 
additives. In the road test he wishes to use cars as blocks; however, because of a time 
constraint, he must use an incomplete block design. He runs the balanced design with the 
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five blocks that follow. Analyze the data from this experiment (use a = 0.05) and draw 
conclusions. 


Additive 1 


2 14 

3 12 

4 13 

5 11 


Car 

2 3 4 

17 14 13 

14 13 

13 12 

11 11 12 

12 10 


12 

10 

9 

8 


4-28. Construct a set of orthogonal contrasts for the data in Problem 4-27. Compute the sum of 
squares for each contrast. 

4-29. Seven different hardwood concentrations are being studied to determine their effect on 
the strength of the paper produced. However, the pilot plant can only produce three runs 
each day. As days may differ, the analyst uses the balanced incomplete block design that 
follows. Analyze the data from this experiment (use a = 0.05) and draw conclusions. 


Hardwood 
Concentration (%) 




Days 




1 

2 

3 

4 

5 

6 

7 

2 

114 




120 


117 

4 

126 

120 




119 


6 


137 

117 




134 

8 

141 


129 

149 




10 


145 


150 

143 



12 



120 


118 

123 


14 




136 


130 

127 


4-30. Analyze the data in Example 4-6 using the general regression significance test. 

4-31. Prove that /fc2f =1 <2?/(An) is the adjusted sum of squares for treatments in a BIBD. 

4-32. An experimenter wishes to compare four treatments in blocks of two runs. Find a BIBD 

for this experiment with six blocks. 

4-33. An experimenter wishes to compare eight treatments in blocks of four runs. Find a BIBD 
with 14 blocks and A = 3. 

4-34. Perform the interblock analysis for the design in Problem 4-27. 

4-35. Perform the interblock analysis for the design in Problem 4-29. 

4-36. Verify that a BIBD with the parameters a = 8, r = 8, k = 4, and b = 16 does not exist. 

4-37. Show that the variance of the intrablock estimators {-fj} is k(a - l)o- 2 /(Aa 2 ). 

4-38. Extended incomplete block designs. Occasionally the block size obeys the relationship 

a < k < 2a. An extended incomplete block design consists of a single replicate of 

each treatment in each block along with an incomplete block design with k* = k - a. 
In the balanced case, the incomplete block design will have parameters k* = k - a, 
r* = r - b, and A*. Write out the statistical analysis. (Hint: In the extended incomplete 
block design, we have A = 2r - b + A*.) 
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5-1 BASIC DEFINITIONS AND PRINCIPLES 

Many experiments involve the study of the effects of two or more factors. In general, 
factorial designs are most efficient for this type of experiment. By a factorial design, 
we mean that in each complete trial or replication of the experiment all possible com¬ 
binations of the levels of the factors are investigated. For example, if there are a levels 
of factor A and b levels of factor B, each replicate contains all ab treatment combinations. 
When factors are arranged in a factorial design, they are often said to be crossed. 

The effect of a factor is defined to be the change in response produced by a change 
in the level of the factor. This is frequently called a main effect because it refers to the 
primary factors of interest in the experiment. For example, consider the simple experi¬ 
ment in Figure 5-1. This is a two-factor factorial experiment with both design factors at 
two levels. We have called these levels “low” and “high” and denoted them “ — ” and 
“+,” respectively. The main effect of factor A in this two-level design can be thought 
of as the difference between the average response at the low level of A and the average 
response at the high level of A. Numerically, this is 


40 + 52 20 + 30 



That is, increasing factor A from the low level to the high level causes an average 
response increase of 21 units. Similarly, the main effect of B is 


30 + 52 20 + 40 



If the factors appear at more than two levels, the above procedure must be modified 
because there are other ways to define the effect of a factor. This point is discussed more 
completely later. 

In some experiments, we may find that the difference in response between the levels 
of one factor is not the same at all levels of the other factors. When this occurs, there is 
170 
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Figure 5-1 A two-factor factorial expert- Figure 5-2 A two-factor factorial experi¬ 
ment, with the response (y) shown at the ment with interaction. 


an interaction between the factors. For example, consider the two-factor factorial ex¬ 
periment shown in Figure 5-2. At the low level of factor B (or B~), the A effect is 

A = 50 - 20 = 30 

and at the high level of factor B (or £ + ), the A effect is 
A = 12 - 40 = -28 

Because the effect of A depends on the level chosen for factor B, we see that there is 
interaction between A and B. The magnitude of the interaction effect is the average 
difference in these two A effects, or AB = (-28 - 30)/2 = -29. Clearly, the interaction 
is large in this experiment. 

These ideas may be illustrated graphically. Figure 5-3 plots the response data in 
Figure 5-1 against factor A for both levels of factor B. Note that the £“ and B + lines are 
approximately parallel, indicating a lack of interaction between factors A and B. Simi¬ 
larly, Figure 5-4 plots the response data in Figure 5-2. Here we see that the B and B + 
lines are not parallel. This indicates an interaction between factors A and B. Graphs such 
as these are frequently very useful in interpreting significant interactions and in reporting 
results to nonstatistically trained personnel. However, they should not be utilized as the 
sole technique of data analysis because their interpretation is subjective and their ap¬ 
pearance is often misleading. 

There is another way to illustrate the concept of interaction. Suppose that both of 
our design factors are quantitative (such as temperature, pressure, time, etc.). Then a 



FactorA 

Figure 5-3 A factorial experiment without 
interaction. 



± 


FactorA 

Figure 5-4 A factorial experiment with 
interaction. 
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regression model representation of the two-factor factorial experiment could be writ¬ 
ten as 

y = Po + + 02*2 + P12X1X2 + € 

where y is the response, the /3’s are parameters whose values are to be determined, x t is a 
variable that represents factor A, x 2 is a variable that represents factor B, and e is a random 
error term. The variables x x and x 2 are defined on a coded scale from — 1 to +1 (the low 
and high levels of A and B), and x L x 2 represents the interaction between x, and x 2 . 

The parameter estimates in this regression model turn out to be related to the effect 
estimates. For the experiment shown in Figure 5-1 we found the main effects of A and 
B to be A = 21 and B = 11. The estimates of and /3 2 are one-half the value of 
the corresponding main effect; therefore, = 21/2 = 10.5 and j 8 2 = 11/2 = 5.5. The 
interaction effect in Figure 5-1 is AB = 1, so the value of interaction coefficient in the 
regression model is /3 12 = 1/2 = 0.5. The parameter /3 0 is estimated by the average of 
all four responses, or /3 0 = (20 + 40 + 30 + 52)/4 = 35.5. Therefore, the fitted regression 
model is 

$ = 35.5 + 10.5*! + 5.5x 2 + 0.5*i* 2 



(a) The response surface 



(6) The contour plot 

Figure 5-5 Response surface and contour plot for the model y = 35.5 + 10.5x, 
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The parameter estimates obtained in the manner for the factorial design with all factors 
at two levels (- and +) turn out to be least squares estimates (more on this later). 

The interaction coefficient (j3i 2 = 0.5) is small relative to the main effect coefficients 
jS] and /3 2 . We will take this to mean that interaction is small and can be ignored. There¬ 
fore, dropping the term 0.5x,x 2 gives us the model 

y = 35.5 + 10.5*! + 5.5x 2 

Figure 5-5 presents graphical representations of this model. In Figure 5-5 a we have a 
plot of the plane of y-values generated by the various combinations of x, and x 2 . This 
three-dimensional graph is called a response surface plot. Figure 5-5 b shows the contour 
lines of constant response y in the x lt x 2 plane. Notice that because the response surface 
is a plane, the contour plot contains parallel straight lines. 

Now suppose that the interaction contribution to this experiment was not negligible; 
that is, the coefficient /3 l2 was not small. Figure 5-6 presents the response surface and 
contour plot for the model 

$ = 35.5 + 10.5^! + 5.5x 2 + 8x,x 2 



(a) The response surface 



(6) The contour plot 

Figure 5-6 Response surface and contour plot for the model y = 35.5 + 

10.5a:, + 5.5x 2 + Sx t x 2 . 
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(We have let the interaction effect be the average of the two main effects.) Notice that 
the significant interaction effect “twists” the plane in Figure 5-6 a. This twisting of the 
response surface results in curved contour lines of constant response in the x u x 2 plane, 
as shown in Figure 5-6 b. Thus, interaction is a form of curvature in the underlying 
response surface model for the experiment. 

The response surface model for an experiment is extremely important and useful. 
We will say more about it in Section 5-5 and in subsequent chapters. 

Generally, when an interaction is large, the corresponding main effects have little 
practical meaning. For the experiment in Figure 5-2, we would estimate the main effect 
of A to be 

50 + 12 20 + 40 

A = __- —=1 

which is very small, and we are tempted to conclude that there is no effect due to A. 
However, when we examine the effects of A at different levels of factor B, we see that 
this is not the case. Factor A has an effect, but it depends on the level of factor B. That 
is, knowledge of the AB interaction is more useful than knowledge of the main effect. A 
significant interaction will often mask the significance of main effects. These points are 
clearly indicated by the interaction plot in Figure 5-4. In the presence of significant 
interaction, the experimenter must usually examine the levels of one factor, say A, with 
levels of the other factors fixed to draw conclusions about the main effect of A. 


5-2 THE ADVANTAGE OF FACTORIALS 

The advantage of factorial designs can be easily illustrated. Suppose we have two factors 
A and B, each at two levels. We denote the levels of the factors by A - , A + , B , and B + . 
Information on both factors could be obtained by varying the factors one at a time, as 
shown in Figure 5-7. The effect of changing factor A is given by A + B~ — A~B~ and the 
effect of changing factor B is given by A~B + - A~B~. Because experimental error is 
present, it is desirable to take two observations, say, at each treatment combination and 
estimate the effects of the factors using average responses. Thus, a total of six observa¬ 
tions are required. 

If a factorial experiment had been performed, an additional treatment combination, 


Factor A 


Figure 5-7 

experiment. 
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Figure 5-8 Relative efficiency of a factorial design to a one- 
factor-at-a-time experiment (two factor levels). 


A + B + , would have been taken. Now, using just four observations, two estimates of the 
A effect can be made: A + B~ - A~B~ and A + B + - A~B + . Similarly, two estimates of 
the B effect can be made. These two estimates of each main effect could be averaged 
to produce average main effects that are just as precise as those from the single¬ 
factor experiment, but only four total observations are required and we would say that 
the relative efficiency of the factorial design to the one-factor-at-a-time experiment is 
(6/4) = 1.5. Generally, this relative efficiency will increase as the number of factors 
increases, as shown in Figure 5-8. 

Now suppose interaction is present. If the one-factor-at-a-time design indicated that 
A B + and A + B~ gave better responses than A~B~, a logical conclusion would be that 
A + B + would be even better. However, if interaction is present, this conclusion may be 
seriously in error. For an example, refer to the experiment in Figure 5-2. 

In summary, note that factorial designs have several advantages. They are more 
efficient than one-factor-at-a-time experiments. Furthermore, a factorial design is nec¬ 
essary when interactions may be present to avoid misleading conclusions. Finally, fac¬ 
torial designs allow the effects of a factor to be estimated at several levels of the other 
factors, yielding conclusions that are valid over a range of experimental conditions. 


5-3 THE TWO-FACTOR FACTORIAL DESIGN 
5-3.1 An Example 

The simplest types of factorial designs involve only two factors or sets of treatments. 
There are a levels of factor A and b levels of factor B, and these are arranged in a factorial 
design; that is, each replicate of the experiment contains all ab treatment combinations. 
In general, there are n replicates. 

As an example of a factorial design involving two factors, an engineer is designing 
a battery for use in a device that will be subjected to some extreme variations in tem¬ 
perature. The only design parameter that he can select at this point is the plate material 
for the battery, and he has three possible choices. When the device is manufactured and 
is shipped to the field, the engineer has no control over the temperature extremes that 
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Table 5-1 Life (in hours) Data for the Battery Design Example 


Material 

Type 



Temperature (°F) 



15 


70 



125 

1 

130 

155 

34 

40 

20 

70 


74 

180 

80 

75 

82 

58 

2 

150 

188 

136 

122 

25 

70 


159 

126 

106 

115 

58 

45 

3 

138 

110 

174 

120 

96 

104 


168 

160 

150 

139 

82 

60 


the device will encounter, and he knows from experience that temperature will probably 
affect the effective battery life. However, temperature can be controlled in the product 
development laboratory for the purposes of a test. 

The engineer decides to test all three plate materials at three temperature levels— 
15,70, and 125°F—because these temperature levels are consistent with the product end- 
use environment. Four batteries are tested at each combination of plate material and 
temperature, and all 36 tests are run in random order. The experiment and the resulting 
observed battery life data are given in Table 5-1. 

In this problem the engineer wants to answer the following questions: 

1. What effects do material type and temperature have on the life of the battery? 

2. Is there a choice of material that would give uniformly long life regardless of 
temperature ? 

This last question is particularly important. It may be possible to find a material alter¬ 
native that is not greatly affected by temperature. If this is so, the engineer can make the 
battery robust to temperature variation in the field. This is an example of using statistical 
experimental design for robust product design, a very important engineering problem. 

This design is a specific example of the general case of a two-factor factorial. To 
pass to the general case, let y iJk be the observed response when factor A is at the ith level 
(t = 1,2 a) and factor B is at the jth level (j = 1,2 b) for the Ath replicate 
(A = 1, 2,...,«). In general, a two-factor factorial experiment will appear as in Table 
5-2. The order in which the abn observations are taken is selected at random so that this 
design is a completely randomized design. 

Table 5-2 General Arrangement for a Two-Factor Factorial Design 

Factor B 



Factor A 
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The observations in a factorial experiment can be described by a model. There are 
several ways to write the model for a factorial experiment. The effects model is 


y.jk = H + Ti + Pj + (TP\j + €y* 


1 = 1 , 2 ,.. 
7 = 1 , 2 ,.. 
k — 1, 2,.. 


a 

b 


n 


(5-1) 


where fi is the overall mean effect, t, is the effect of the ith level of the row factor A, pj 
is the effect of the jth level of column factor B, (r/3)y is the effect of the interaction 
between r, and Pj, and e iJk is a random error component. Both factors are assumed to be 
fixed, and the treatment effects are defined as deviations from the overall mean, so 
2Uti = 0 and EjLijS,- = 0. Similarly, the interaction effects are fixed and are defined 
such that 2f =1 (r/3)y = Ej’ =1 (T)3)y = 0. Because there are n replicates of the experiment, 
there are abn total observations. 

Another possible model for a factorial experiment is the means model 


= p-a + f-ijk 


i = 1,2 a 
j = 1,2,..., b 
k= 1,2,..., n 


where the mean of the i/th cell is 


B-ij = (i+ T, + pj + (Tp)ij 

We could also use a regression model as in Section 5-1. Regression models are partic¬ 
ularly useful when one or more of the factors in the experiment are quantitative. Through¬ 
out most of this chapter we will use the effects model (Equation 5-1) with reference to 
the regression model in Section 5-5. 

In the two-factor factorial, both row and column factors (or treatments), A and B, 
are of equal interest. Specifically, we are interested in testing hypotheses about the 
equality of row treatment effects, say 


H 0 : Ti = t 2 = • • ■ = T a = 0 
Hi : at least one t, A 0 


(5-2a) 


and the equality of column treatment effects, say 

Ho-.Pi = p 2 = = P„ = 0 

Hi: at least one j3, # 0 


(5-2b) 


We are also interested in determining whether row and column treatments interact. Thus, 
we also wish to test 


H 0 :(Tp),j = 0 for all ij 
Hi rat least one (t/3), 7 AO 

We now discuss how these hypotheses are tested using a two-factor analysis of 
variance. 


5-3.2 Statistical Analysis of the Fixed Effects Model 

Let y,.. denote the total of all observations under the ith level of factor A, denote the 
total of all observations under the y'th level of factor B, y tj _ denote the total of all obser- 
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vations in the ijth cell, and y... denote the grand total of all the observations. Define 
y t y.j., yij., and y„. as the corresponding row, column, cell, and grand averages. Ex¬ 
pressed mathematically, 


= 2 2 y-jk 

y,=f /=] 

bn 

1, 2,. 

.., a 

= 2 2 yuk 

v,. = ^ j- I 

1, 2,. 

..,b 

v 

yu. i = ] 

L, 2, . 

.., a 

= 2j ytjk 

= V j = i 

1, 2, . 

..,b 

= 2 2 2 yak 

y -2=, 
abn 




The total corrected sum of squares may be written as 

2 2 2 (y-jk - yJ 2 = 22 2 [(y... - yJ + (%■ ~ y J 

+ (yij. ~ yu. ~ y.j. + yJ + (y iJk - y #)] 2 
= bn 2 (Pi. - y ...) 2 + an J) (y.j. - y.J 2 (5-4) 

+ « 2 2 (yu. ~ yi .. - y.j. + y...) 2 

+ 2 2 2 (y.y* - S’y.) 2 

because the six cross-products on the right-hand side are zero. Notice that the total sum 
of squares has been partitioned into a sum of squares due to “rows,” or factor A, (SS A ); 
a sum of squares due to “columns,” or factor B, (SS B ); a sum of squares due to the 
interaction between A and B, (SS AB )\ and a sum of squares due to error, (SS E ). From the 
last component on the right-hand side of Equation 5-4, we see that there must be at least 
two replicates (n 3 s 2) to obtain an error sum of squares. 

We may write Equation 5-4 symbolically as 

SS T = SS A + SS B + SS AB + SS E (5-5) 

The number of degrees of freedom associated with each sum of squares is 


Effect Degrees of Freedom 

A a — l 

B b - 1 

(a - 1 )(b - 1) 
ab(n — 1) 


AB interaction 
Error 
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We may justify this allocation of the abn — 1 total degrees of freedom to the sums of 
squares as follows: The main effects A and B have a and b levels, respectively; therefore 
they have a - l and b — 1 degrees of freedom as shown. The interaction degrees of 
freedom are simply the number of degrees of freedom for cells (which is ab - 1) minus 
the number of degrees of freedom for the two main effects A and B; that is, ab - 1 - 
(a - 1) — (b — 1) = (a — \)(b - 1). Within each of the ab cells, there are n - 1 
degrees of freedom between the n replicates; thus there are ab(n — 1) degrees of freedom 
for error. Note that the number of degrees of freedom on the right-hand side of Equation 
5-5 adds to the total number of degrees of freedom. 

Each sum of squares divided by its degrees of freedom is a mean square. The 
expected values of the mean squares are 


E(MS, 


. . bn E t? 


E(MS b ) = 


H£0-‘ 


b - 1 

n 1 E (t/3 % 

f (a - l)(b - 1) 


E(MS e ) = E 


{ SS = a 2 
\ab(n - 1)/ 


Notice that if the null hypotheses of no row treatment effects, no column treatment 
effects, and no interaction are true, then MS A , MS B , MS AB , and MS E all estimate a 2 . 
However, if there are differences between row treatment effects, say, then MS A will be 
larger than MS E . Similarly, if there are column treatment effects or interaction present, 
then the corresponding mean squares will be larger than MS E . Therefore, to test the 
significance of both main effects and their interaction, simply divide the corresponding 
mean square by the error mean square. Large values of this ratio imply that the data do 
not support the null hypothesis. 

If we assume that the model (Equation 5-1) is adequate and that the error terms 
e ijk are normally and independently distributed with constant variance cr 2 , then each of 
the ratios of mean squares MS A /MS E , MS B /MS E , and MS AB /MS E are distributed as F with 
a - 1, b - 1, and (a - \){b - 1) numerator degrees of freedom, respectively, and 
ab(n - 1) denominator degrees of freedom, 1 and the critical region would be the upper 
tail of the F distribution. The test procedure is usually summarized in an analysis of 
variance table, as shown in Table 5-3 on the next page. 

Computationally, we usually employ a statistical software package to conduct an 


1 The F test may be viewed as an approximation to a randomization test, as noted previously. 
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Table 5-3 The Analysis of Variance Table for the Two-Factor Factorial, Fixed Effects Model 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean Square 

To 

A treatments 

SS A 

a - 1 

SS A 

m A = 

ms a 

0 ms e 

B treatments 

SS B 

b - 1 

ss B 

ms b = 

F 0 = ^ 
ms e 

Interaction 

SSab 

(a - l)(f> - 1) 


_ms ab 
F ° _ ms e 

Error 

SS e 

ab(n - 1) 

SS F 

ms e = 

ab(n — 1) 


Total 

SS t 

abn - 1 




analysis of variance. However, manual computing formulas for the sums of squares in 
Equation 5-5 may be obtained easily. The total sum of squares is computed as usual by 

ss r =iii%-£ <5-6) 

The sums of squares for the main effects are 


SS A 


= — j'y 2 

bn h y '~ 


abn 


(5-7) 


and 


: — i / - — 

an y -‘- abn 


It is convenient to obtain the SS AB in two stages. First we compute the sum of squares 
between the ab cell totals, which is called the sum of squares due to “subtotals”: 


= - i £ t 2 ' - — 

s n h h y ' J - abn 


This sum of squares also contains SS A and SS B . Therefore, the second step is to compute 
SS AB as 


SS AB = Subtotals - SS A - SS B (5-9) 

We may compute SS E by subtraction as 

SS E = SS T - SS AB - SS A - SS B (5-10) 


or 


SS E = SS T - S5 Subtotals 


EXAMPLE 5-1 . 

The Battery Design Experiment 

Table 5-4 presents the effective life (in hours) observed in the battery design example 
described in Section 5-3.1. The row and column totals are shown in the margins of the 
table and the circled numbers are the cell totals. 
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Table 5-4 Life Data (in hours) for the Battery Design Experiment 


Material _ 


Temperature (°F) 



y,.. 

Type 

15 


70 


125 

130 

74 

155 (Q) 
180 

34 

80 


20 

82 

70 ^30) 

58 

998 

2 150 
159 

S@ 

136 

106 

IS© 

25 

58 

70 (l98) 

45 V-' 

1300 

3 138 

168 

:s© 

174 

150 


96 

82 

104 (Q) 

60 

1501 

y.j. 

1738 


1291 


770 

3799 = y... 


The sums of squares are computed as follows: 


- (60) 2 - (37 ^ 9) = 77,646.97 
36 


ssr =^ 

= (130) 2 + (155) 2 + (74) 2 + • 

^Material = T~ 2 }£. X" 

bn abn 

1 (3799) 2 

= 7^7 [(998) 2 + (1300) 2 + (1501) 2 ] - K -— L = 10,683.72 
(3)(4) 36 

ss = — y v 2 - 

Temperature ^ Zj J-j- 

1 (3799) 2 

= 7^7 « 1738 ) 2 + 0291) 2 + ( 77 °) 2 ] - = 39,118.72 

(3)(4) 36 

^Interaction = - 2) 2 yfj. — — ^Material — SS Tem perature 

1 (3799) 2 

M - [(539) 2 + (229) 2 + • ■ • + (342) 2 ] - ’ - 10,683.72 

4 36 

- 39,118.72 = 9613.78 


SS E = SS T — SS M aterial — ^Temperature — ^Interaction 

= 77,646.97 - 10,683.72 - 39,118.72 - 9613.78 = 18,230.75 

The analysis of variance is shown in Table 5-5 on the next page. Because Fo. 05 , 4,27 = 2.73, 
we conclude that there is a significant interaction between material types and temperature. 
Furthermore, F 0m05X2 7 = 3.35, so the main effects of material type and temperature are 
also significant. Table 5-5 also shows the P-values for the test statistics. 

To assist in interpreting the results of this experiment, it is helpful to construct a 
graph of the average responses at each treatment combination. This graph is shown in 
Figure 5-9 at the bottom the the next page. The significant interaction is indicated by the 
lack of parallelism of the lines. In general, longer life is attained at low temperature, 
regardless of material type. Changing from low to intermediate temperature, battery life 
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Table 5-5 Analysis of Variance for Battery Life Data 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

E-Value 

Material types 

10,683.72 

2 

5,341.86 

7.91 

0.0020 

Temperature 

39,118.72 

2 

19,559.36 

28.97 

0.0001 

Interaction 

9,613.78 

4 

2,403.44 

3.56 

0.0186 

Error 

18,230.75 

27 

675.21 



Total 

77,646.97 

35 





with material type 3 actually increases, whereas it decreases for types 1 and 2. From 
intermediate to high temperature, battery life decreases for material types 2 and 3 and is 
essentially unchanged for type 1. Material type 3 seems to give the best results if we 
want less loss of effective life as the temperature changes. 


Multiple Comparisons 

When the analysis of variance indicates that row or column means differ, it is usually of 
interest to make comparisons between the individual row or column means to discover 
the specific differences. The multiple comparison methods discussed in Chapter 3 are 
useful in this regard. 

We now illustrate the use of Tukey’s test on the battery life data in Example 5-1. 
Note that in this experiment, interaction is significant. When interaction is significant, 
comparisons between the means of one factor (e.g., A) may be obscured by the AB 
interaction. One approach to this situation is to fix factor B at a specific level and apply 
Tukey’s test to the means of factor A at that level. To illustrate, suppose that in Example 
5-1 we are interested in detecting differences among the means of the three material 
types. Because interaction is significant, we make this comparison at just one level of 
temperature, say level 2 (70°F). We assume that the best estimate of the error variance 
is the MS e from the analysis of variance table, utilizing the assumption that the experi¬ 
mental error variance is the same over all treatment combinations. 





5-3 THE TWO-FACTOR FACTORIAL DESIGN 


183 


The three material type averages at 70°F arranged in ascending order are 

y 12 . = 57.25 (material type 1) 

y 22 . = 119.75 (material type 2) 

y 32 . = 145.75 (material type 3) 

and 

T 0 ,05 = <7o.o S (3, 27) 

= 3.50 
= 45.47 

where we obtained ^o.os(3, 27) — 3.50 by interpolation in Appendix Table VIII. The 
pairwise comparisons yield 

3 vs. 1: 145.75 - 57.25 = 88.50 > T 0 . 0S = 45.47 

3 vs. 2: 145.75 - 119.75 = 26.00 < r 0 . 05 = 45.47 

2 vs. 1: 119.75 - 57.25 = 62.50 > T 0 .o 5 = 45.47 

This analysis indicates that at the temperature level 70°F, the mean battery life is the 
same for material types 2 and 3, and that the mean battery life for material type 1 is 
significantly lower in comparison to both types 2 and 3. 

If interaction is significant, the experimenter could compare all ab cell means to 
determine which ones differ significantly. In this analysis, differences between cell means 
include interaction effects as well as both main effects. In Example 5-1, this would give 
36 comparisons between all possible pairs of the nine cell means. 



Computer Output 

Figure 5-10 (on the next page) presents condensed computer output from Design-Expert 
for the battery life data in Example 5-1. Note that 

SS Mo de , = SS M a,eria. + .^Temperature + SShuemctkm 

= 10,683.72 + 39,118.72 + 9613.78 
= 59,416.22 


and that 


R 2 


■SS Mode i 
SS Total 


59,416.22 

77,646.97 


= 0.7652 


That is, about 77 percent of the variability in the battery life is explained by the plate 
material in the battery, the temperature, and the material type—temperature interaction. 
The residuals from the fitted model are also displayed on the computer output. We now 
show how to use these residuals in model adequacy checking. 
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Response: Life in hours 

AIMOVA for Selected Factorial Model 


Analysis of variance table [Partial sum of squares] 





Sum of 

Mean 


F 



Source 

Squares 

DF Square 

Value Prob 

> F 


Model 

59416.22 

8 7427.03 

11.00 <0.0001 

significant 


A 10683.72 

2 5341.86 

7.91 0.0020 



B 39118.72 

2 19559.36 

28.97 <0.0001 


AB 9613.78 

4 2403.44 

3.56 0.0186 


Residual 

18230.75 

27 675.21 




Lack of Fit 

0.000 

0 





Pure Error 18230.75 

27 675.21 




Cor Total 

77646.97 

35 





Std. Dev. 


25.98 

R-Squared 


0.7652 



Mean 


105.53 

Adj R-Squared 

0.6956 



C.V. 


24.62 

Pred R-Squared 

0.5826 



PRESS 

32410.22 

Adeq Precision 

8.178 



Diagnostics Case Statistics 






Standard 

Actual 

Predicted 



Student 

Cook's 

Outlier 

Order 

Value 

Value 

Residual Leverage Residual 

Distance t 

1 

130.00 

134.75 

-4.75 

0.250 

-0.211 

0.002 

-0.207 

2 

74.00 

134.75 

-60.75 

0.250 

-2.700 

0.270 

-3.100 

3 

155.00 

134.75 

20.25 

0.250 

0.900 

0.030 

0.897 

4 

180.00 

134.75 

45.25 

0.250 

2.011 

0.150 

2.140 

5 

150.00 

155.75 

-5.75 

0.250 

-0.256 

0.002 

-0.251 

6 

159.00 

155.75 

3.25 

0.250 

0.144 

0.001 

0.142 

7 

188.00 

155.75 

32.25 

0.250 

1.433 

0.076 

1.463 

8 

126.00 

155.75 

-29.75 

0.250 

-1.322 

0.065 

-1.341 

9 

138.00 

144.00 

-6.00 

0.250 

-0.267 

0.003 

-0.262 

10 

168.00 

144.00 

24.00 

0.250 

1.066 

0.042 

1.069 

11 

110.00 

144.00 

-34.00 

0.250 

1.511 

0.085 

-1.550 

12 

160.00 

144.00 

16.00 

0.250 

0.711 

0.019 

0.704 

13 

34.00 

57.25 

-23.25 

0.250 

-1.033 

0.040 

-1.035 

14 

80.00 

57.25 

22.75 

0.250 

1.011 

0.038 

1.011 

15 

40.00 

57.25 

-17.25 

0.250 

-0.767 

0.022 

-0.761 

16 

75.00 

57.25 

17.75 

0.250 

0.789 

0.023 

0.783 

17 

136.00 

119.75 

16.25 

0.250 

0.722 

0.019 

0.716 

18 

106.00 

119.75 

-13.75 

0.250 

-0.611 

0.014 

-0.604 

19 

122.00 

119.75 

2.25 

0.250 

0.100 

0.000 

0.098 

20 

115.00 

119.75 

-4.75 

0.250 

-0.211 

0.002 

-0.207 

21 

174.00 

145.75 

28.25 

0.250 

1.255 

0.058 

1.269 

22 

150.00 

145.75 

4.25 

0.250 

0.189 

0.001 

0.185 

23 

120.00 

145.75 

-25.75 

0.250 

-1.144 

0.048 

-1.151 

24 

139.00 

145.75 

-6.75 

0.250 

-0.300 

0.003 

-0.295 

25 

20.00 

57.50 

-37.50 

0.250 

-1.666 

0.103 

-1.726 

26 

82.00 

57.50 

24.50 

0.250 

1.089 

0.044 

1.093 

27 

70.00 

57.50 

12.50 

0.250 

0.555 

0.011 

0.548 

28 

58.00 

57.50 

0.50 

0.250 

0.022 

0.000 

0.022 

29 

25.00 

49.50 

-24.50 

0.250 

-1.089 

0.044 

-1.093 

30 

58.00 

49.50 

8.50 

0.250 

0.378 

0.005 

0.372 

31 

70.00 

49.50 

20.50 

0.250 

0.911 

0.031 

0.908 

32 

45.00 

49.50 

-4.50 

0.250 

-0.200 

0.001 

-0.196 

33 

96.00 

85.50 

10.50 

0.250 

0.467 

0.008 

0.460 

34 

82.00 

85.50 

-3.50 

0.250 

-0.156 

0.001 

-0.153 

35 

104.00 

85.50 

18.50 

0.250 

0.822 

0.025 

0.817 

36 

60.00 

85.50 

-25.50 

0.250 

-1.133 

0.048 

-1.139 


Figure 5-10 Design-Expert output for Example 5-1. 
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5-3.3 Model Adequacy Checking 

Before the conclusions from the analysis of variance are adopted, the adequacy of the 
underlying model should be checked. As before, the primary diagnostic tool is residual 
analysis. The residuals for the two-factor factorial model are 

e iJk = yijk - Sijk (5-11) 

and because the fitted value y iJk = y,y. (the average of the observations in the ijth cell), 
Equation 5-11 becomes 

e ijk = y m - yij. (5-12) 

The residuals from the battery life data in Example 5-1 are shown in the Design- 
Expert computer output (Figure 5-10) and in Table 5-6. The normal probability plot 
of these residuals (Figure 5-11, page 186) does not reveal anything particularly trouble¬ 
some, although the largest negative residual (—60.75 at 15°F for material type 1) 
does stan d out so mewhat from the others. The standardized value of this residual is 
-60.75/V675.21 = —2.34, and this is the only residual whose absolute value is larger 
than 2. 

Figure 5-12 (page 186) plots the residuals versus the fitted values y iJk . This plot 
indicates a mild tendency for the variance of the residuals to increase as the battery life 
increases. Figures 5-13 and 5-14 (page 187) plot the residuals versus material types and 
temperature, respectively. Both plots indicate mild inequality of variance, with the treat¬ 
ment combination of 15°F and material type 1 possibly having larger variance than the 
others. 

From Table 5-6 we see that the 15°F-material type 1 cell contains both extreme 
residuals (-60.75 and 45.25). These two residuals are primarily responsible for the in¬ 
equality of variance detected in Figures 5-12, 5-13, and 5-14. Reexamination of the data 
does not reveal any obvious problem, such as an error in recording, so we accept these 
responses as legitimate. It is possible that this particular treatment combination produces 
slightly more erratic battery life than the others. The problem, however, is not severe 
enough to have a dramatic impact on the analysis and conclusions. 


5-3.4 Estimating the Model Parameters 


The parameters in the effects model for two-factor factorial 

yijk = + Tt + iBj + (tj8 )ij + e lJk (5-13) 


Table 5-6 

Residuals for Example 5-1 








Temperature (°F) 



Type 

15 


70 


125 


1 

-4.75 

20.25 

-23.25 

-17.25 

-37.50 

12.50 


-60.75 

45.25 

22.75 

17.75 

24.50 

0.50 

2 

-5.75 

32.25 

16.25 

2.25 

-24.50 

20.50 


3.25 

-29.75 

-13.75 

-4.75 

8.50 

-4.50 

3 

-6.00 

-34.00 

28.25 

-25.75 

10.50 

18.50 


24.00 

16.00 

4.25 

-6.75 

-3.50 

-25.50 
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-60.75 -34.25 -7.75 18.75 45.25 

Residual 

Figure 5-11 Normal probability plot of residuals for Example 5-1. 


may be estimated by least squares. Because the model has 1 + a + ab parameters to be 
estimated, there are 1 + a + b + ab normal equations. Using the method of Section 3-9, 
it is not difficult to show that the normal equations are 


/ 1 : abnfx + bn t- + an $,• + n X 2 (jPkj = T... 


(5-14a) 
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Figure 5-13 Plot of residuals versus material type for Example 
5-1. 


t,-: bn/l + bnfj + « ^ ft + " E ( T P)y = yt.. i = 1,2 a (5-14b) 

j8 7 : anfL + n ^ t, + an/3, + « 2 (t/3), 7 = j = 1, 2,. .., b (5-14c) 

(TjSV rc/2 + nf, + n/3, + «(Tj8)y = ^ j* = j’ ^.^ (5-14d) 

For convenience, we have shown the parameter corresponding to each normal equation 
on the left in Equations 5-14. 

The effects model (Equation 5-13) is an overparameterized model. Notice that the a 

40 

20 


-60 

-80 

Figure 5-14 Plot of residuals versus temperature for Example 
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equations in Equation 5-14b sum to Equation 5-14a and that the b equations of Equation 
5-14c sum to Equation 5-14a. Also summing Equation 5-14d over j for a particular i will 
give Equation 5-14b, and summing Equation 5-14d over i for a particular j will give 
Equation 5-14c. Therefore, there are a + b + 1 linear dependencies in this system of 
equations and no unique solution will exist. In order to obtain a solution, we impose the 
constraints 


E T = o 

(5-15a) 

'b 


EA = ° 

(5-15b) 

i {Tp), = 0 j = 1 , 2 . b 

(5-15c) 

2 <jP)v = 0 i= 1 , 2 ,..., a 

(5-15d) 


Equations 5-15a and 5-15b constitute two constraints, whereas Equations 5-15c and 
5-15d form a + b - 1 independent constraints. Therefore, we have a + b + 1 total 
constraints, the number needed. 

Applying these constraints, the normal equations (Equations 5-14) simplify consid¬ 
erably, and we obtain the solution 

A = 5L 

ft = y,„ - y... i = 1,2 

b = y.j. ~ 5 L j = i, 2,..., b 

(jP)ij = yn. - y± ~ y.j. + y... = j [2, .,b (5 ' 16) 

Notice the considerable intuitive appeal of this solution to the normal equations. Row 
treatment effects are estimated by the row average minus the grand average; column 
treatments are estimated by the column average minus the grand average; and the z/th 
interaction is estimated by the z/th cell average minus the grand average, the z'th row 
effect, and the y'th column effect. 

Using Equation 5-16, we may find the fitted value y ijk as 

y ijk = A + *, + Pj + (9 h 
= y... + (y,„ - y...) + (y.j. - yJ 
+ ( y v . ~ yi.. ~ y.j. + yJ 
= y.j. 

That is, the kth observation in the z/th cell is estimated by the average of the n observations 
in that cell. This result was used in Equation 5-12 to obtain the residuals for the two- 
factor factorial model. 

Because constraints (Equations 5-15) have been used to solve the normal equations, 
the model parameters are not uniquely estimated. However, certain important functions 
of the model parameters are estimable, that is, uniquely estimated regardless of the 
constraint chosen. An example is t, - t„ + (tj8),. - (t/ 3)„., which might be thought of 
as the “true” difference between the z'th and the zzth levels of factor A. Notice that the 
true difference between the levels of any main effect includes an “average” interaction 
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effect. It is this result that disturbs the tests on main effects in the presence of interaction, 
as noted earlier. In general, any function of the model parameters that is a linear com¬ 
bination of the left-hand side of the normal equations is estimable. This property was 
also noted in Chapter 3 when we were discussing the single-factor model. For more 
information, see the supplemental text material for this chapter. 


5-3.5 Choice of Sample Size 


The operating characteristic curves in Appendix Chart V can be used to assist the ex¬ 
perimenter in determining an appropriate sample size (number of replicates, n) for a two- 
factor factorial design. The appropriate value of the parameter O 2 and the numerator and 
denominator degrees of freedom are shown in Table 5-7. 

A very effective way to use these curves is to find the smallest value of d> 2 corre¬ 
sponding to a specified difference between any two treatment means. For example, if the 
difference in any two row means is D, then the minimum value of d> 2 is 


O 2 = ; 


(5-17) 


whereas if the difference in any two column means is D, then the minimum value of d> 2 


<b 2 


naD 2 
2 b(T 2 


(5-18) 


Finally, the minimum value of d> 2 corresponding to a difference of D between any two 
interaction effects is 


O 2 = 


_ ntf_ _ 

2<r 2 [(a - 1 )(b - 1) + 1] 


(5-19) 


To illustrate the use of these equations, consider the battery life data in Example 
5-1. Suppose that before running the experiment we decide that the null hypothesis should 
be rejected with a high probability if the difference in mean battery life between any two 


Table 5-7 Operating Characteristic Curve Parameters for Chart V of the Appendix 
for the Two-Factor Factorial, Fixed Effects Model 




Numerator 

Denominator 

Factor 

<b 2 

Degrees of Freedom 

Degrees of Freedom 

A 

tail/ 

a - 1 

ab(n - 1) 


ao 2 



B 

° n i& 

b - 1 

ab(n - 1) 


ba 2 



AB 

n i i ml 

(a - 1X6 - 1) 

ab(n - 1) 


o - 2 [(« - 1X6 - 1) + 1] 
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temperatures is as great as 40 hours. Thus D = 40, and if we assume that the standard 
deviation of battery life is approximately 25, then Equation 5-18 gives 


«(3)(40) 2 

2(3)(25) 2 


= 1.28n 


as the minimum value of O 2 . Assuming that a = 0.05, we can now use Appendix Table 
V to construct the following display: 


n 

<h 2 

<h 

v, = Numerator 
Degrees of Freedom 

v 2 = Error Degrees 
of Freedom 

P 

2 

2.56 

1.60 

2 

9 

0.45 

3 

3.84 

1.96 

2 

18 

0.18 

4 

5.12 

2.26 

2 

27 

0.06 


Note that n = 4 replicates give a /3 risk of about 0.06, or approximately a 94 percent 
chance of rejecting the null hypothesis if the difference in mean battery life at any two 
temperature levels is as large as 40 hours. Thus, we conclude that four replicates are 
enough to provide the desired sensitivity as long as our estimate of the standard deviation 
of battery life is not seriously in error. If in doubt, the experimenter could repeat the 
above procedure with other values of a to determine the effect of misestimating this 
parameter on the sensitivity of the design. 


5-3.6 The Assumption of No Interaction in a Two-Factor Model 

Occasionally, an experimenter feels that a two-factor model without interaction is 
appropriate, say 

{ / = 1, 2,a 
j = 1,2 (5-20) 

k = 1, 2,..., n 

We should be very careful in dispensing with the interaction terms, however, because 
the presence of significant interaction can have a dramatic impact on the interpretation 
of the data. 

The statistical analysis of a two-factor factorial model without interaction is straight¬ 
forward. Table 5-8 presents the analysis of the battery life data from Example 5-1, as- 


Table 5-8 Analysis of Variance for Battery Life Data Assuming No Interaction 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

Material types 

10,683.72 

2 

5,341.86 

5.95 

Temperature 

39,118.72 

2 

19,559.36 

21.78 

Error 

27,844.52 

31 

898.21 


Total 

77,646.96 

35 
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suming that the no-interaction model (Equation 5-20) applies. As noted previously, both 
main effects are significant. However, as soon as a residual analysis is performed for 
these data, it becomes clear that the no-interaction model is inadequate. For the two- 
factor model without interaction, the fitted values are y iJk = y,„ + y. y . — y.„. A plot of 
Jij. - y ijk (the cell averages minus the fitted value for that cell) versus the fitted value y ijk 
is shown in Figure 5-15. Now the quantities y ih — y iJk may be viewed as the differences 
between the observed cell means and the estimated cell means assuming no interaction. 
Any pattern in these quantities is suggestive of the presence of interaction. Figure 5-15 
shows a distinct pattern as the quantities y,y. — y,y* move from positive to negative to 
positive to negative again. This structure is the result of interaction between material 
types and temperature. 


5-3.7 One Observation per Cell 

Occasionally, one encounters a two-factor experiment with only a single replicate, that 
is, only one observation per cell. If there are two factors and only one observation per 
cell, the effects model is 

yij = ti+ 77 + ft + (t/3),j + e 0 - jj Z\[l['"] a b (5-21) 

The analysis of variance for this situation is shown in Table 5-9 on the next page, as¬ 
suming that both factors are fixed. 

From examining the expected mean squares, we see that the error variance cr 2 is not 
estimable; that is, the two-factor interaction effect (r/3),y and the experimental error cannot 
be separated in any obvious manner. Consequently, there are no tests on main effects 
unless the interaction effect is zero. If there is no interaction present, then (tj 3),y = 0 for 
all i and j, and a plausible model is 

*/ = /* + *V + ft + e, jj: \[ 2 t '"[ a b (5-22) 

If the model (Equation 5-22) is appropriate, then the residual mean square in Table 5-9 
is an unbiased estimator of cr 2 , and the main effects may be tested by comparing MS A 
and MS b to MS Residual . 


T 

30 F 



200 


1 

-301- 

Figure 5-15 Plot of y if _ - y.jt versus y ljk , battery life data. 
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Table 5-9 Analysis of Variance for a Two-Factor Model, One Observation per Cell 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

Expected Mean 
Square 

Rows (A) 

ii b ab 

a — l 

MS a 



y 2 




Columns (5) 

V 12 - 2k 

p x a ab 

b - 1 

ms b 






, £ 2 (i®S 

Residual or AB 

Subtraction 

(a - l)(b - 1) 

MS Residual 

a + (a - 1 )(b - 1) 

Total 

11/-5 

— 1 




A test developed by Tukey (1949a) is helpful in determining if interaction is present. 
The procedure assumes that the interaction term is of a particularly simple form, namely, 


where y is an unknown constant. By defining the interaction term this way, we may use 
a regression approach to test the significance of the interaction term. The test partitions 
the residual sum of squares into a single-degree-of-freedom component due to nonad¬ 
ditivity (interaction) and a component for error with (a - 1 ){b - 1) - 1 degrees of 
freedom. Computationally, we have 


[S 2 )Wv “ V- + ss b - 


with one degree of freedom, and 


with (a - 1 )(b - 1) - 1 degrees of freedom. To test for the presence of interaction, we 
compute 


i, the hypothesis of no interaction must be rejected. 


The impurity present in a chemical product is affected by two factors—pressure and 
temperature. The data from a single replicate of a factorial experiment are shown in Table 
5-10. The sums of squares are 


b h y ab 
-- \ [23 2 + 13 2 + 8 2 ] - 
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Table 5-10 Impurity Data for Example 5-2 


Temperature 

(°F) 



Pressure 



25 

30 

35 

40 

45 

yu 

100 

5 

4 

6 

3 

5 

23 

125 

3 

1 

4 

2 

3 

13 

150 

1 

1 

3 

1 

2 

8 

y.j 

9 

6 

13 

6 

10 

44 = y 


a j= i ab 

1 44 2 

= - [9 2 + 6 2 + 13 2 + 6 2 + 10 2 ] - — = 11.60 

= 166 - 129.07 = 36.93 


„ = 55 r - SS A - SS B 
= 36.93 - 23.33 - 11.60 = 2.00 


The sum of squares for nonadditivity is computed from Equation 5-23 as follows: 
E E JW,/ = (5X23X9) + (4X23X6) + • • • + (2)(8)(10) = 7236 

[e E iw,/ - y~ (ss A + ss B + j 


abSS A SS B 

[7236 - (44X23.33 + 11.60 + 129.07)] 2 
(3X5X23.33X11.60) 

[20.00] 2 


4059.42 


= 0.0985 


and the error sum of squares is, from Equation 5-24, 

Error = 55R es id U al — SSfj 

= 2.00 - 0.0985 = 1.9015 

The complete analysis of variance is summarized in Table 5-11 on the next page. 
The test statistic for nonadditivity is F 0 = 0.0985/0.2716 = 0.36, so we conclude that 
there is no evidence of interaction in these data. The main effects of temperature and 
pressure are significant. 


In concluding this section, we note that the two-factor factorial model with one 
observation per cell (Equation 5-22) looks exactly like the randomized complete block 
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Table 5-11 Analysis of Variance for Example 5-2 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

F 0 

P-Value 

Temperature 

23.33 

2 

11.67 

42.97 

0.0001 

Pressure 

11.60 

4 

2.90 

10.68 

0.0042 

Nonadditivity 

0.0985 

1 

0.0985 

0.36 

0.5674 

Error 

1.9015 

7 

0.2716 



Total 

36.93 

14 





model (Equation 4-1). In fact, the Tukey single-degree-of-freedom test for nonadditivity 
can be directly applied to test for interaction in the randomized block model. However, 
remember that the experimental situations that lead to the randomized block and fac¬ 
torial models are very different. In the factorial model, all ab runs have been made in 
random order, whereas in the randomized block model, randomization occurs only within 
the block. The blocks are a randomization restriction. Hence, the manner in which the 
experiments are run and the interpretation of the two models are quite different. 


5-4 THE GENERAL FACTORIAL DESIGN 


The results for the two-factor factorial design may be extended to the general case where 
there are a levels of factor A, b levels of factor B, c levels of factor C, and so on, arranged 
in a factorial experiment. In general, there will be abc ■ ■ ■ n total observations if there 
are n replicates of the complete experiment. Once again, note that we must have at least 
two replicates (n 5= 2) to determine a sum of squares due to error if all possible inter¬ 
actions are included in the model. 

If all factors in the experiment are fixed, we may easily formulate and test hypotheses 
about the main effects and interactions. For a fixed effects model, test statistics for each 
main effect and interaction may be constructed by dividing the corresponding mean 
square for the effect or interaction by the mean square error. All of these F tests will be 
upper-tail, one-tail tests. The number of degrees of freedom for any main effect is the 
number of levels of the factor minus one, and the number of degrees of freedom for an 
interaction is the product of the number of degrees of freedom associated with the in¬ 
dividual components of the interaction. 

For example, consider the three-factor analysis of variance model: 


you = /r + T,- + + y k + (Tp)ij + (ryk + (Py) jk 


+ {rPyhjk + z.jki 


i = 1, 2,. .., a 
j = 1, 2,..., b 
k= 1,2,..., c 
/ = 1, 2,..., n 


(5-26) 


Assuming that A, B, and C are fixed, the analysis of variance table is shown in Table 
5-12. The F tests on main effects and interactions follow directly from the expected mean 
squares. 



lysis of Variance Table for the Three-Factor Fixed Effects Model 



II II II II II II II 

k° h° b° b? b° h° b° 
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Usually, the analysis of variance computations would be done using a statistics soft¬ 
ware package. However, the manual computing formulas for the sums of squares in Table 
5-12 are occasionally useful. The total sum of squares is found in the usual way as 

< 5 - 27 ) 


The sums of squares for the main effects are found from the totals for factors A(y L J, 
and C(y„*.) as follows: 


ss. - X 2 yf- 

ben Pi 

- y~ 

aben 

(5-28) 

SSb = acn^x y2j " 

_ yi. 

abcn 

(5-29) 

ss c = — 7 - £ 

abn Pi 

_ yi. 

abcn 

(5-30) 


To compute the two-factor interaction sums of squares, the totals for the A X B, A X C, 
and B X C cells are needed. It is frequently helpful to collapse the original data table 
into three two-way tables to compute these quantities. The sums of squares are found 
from 


cn fz i p x abcn 


— SSsubtotalsMfi) SS A SS B 

ss A c = ±± ±yb 


bn ft 
z SSsubtotals(AC) 


abcn 

- SS A - ss c 


- SS A - SSc 


(5-31) 

(5-32) 


and 


SS BC = - £ £ y%. -pp-s 
an Pi Pi abcn 

= 5Ssubtotals(BC) — SS B ~ SS C 


(5-33) 


Note that the sums of squares for the two-factor subtotals are found from the totals in 
each two-way table. The three-factor interaction sum of squares is computed from the 
three-way cell totals { y ljk .} as 

SS ABC = - £ £ £ y?*. - ~ SS A - SS B - SS C - SS AB - SS AC - SS BC (5-34a) 

n i= i j—i k=i abcn 

= SS Suh{Mals(AHC) - SS A - SS B - SSc - SS AB - SS AC - SS BC (5-34b) 

The error sum of squares may be found by subtracting the sum of squares for each main 
effect and interaction from the total sum of squares or by 


SS E = SS T - SS, 


( 5 - 35 ) 
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EXAMPLE 5-3 . 

The Soft Drink Bottling Problem 

A soft drink bottler is interested in obtaining more uniform fill heights in the bottles 
produced by his manufacturing process. The filling machine theoretically fills each bottle 
to the correct target height, but in practice, there is variation around this target, and the 
bottler would like to understand better the sources of this variability and eventually 
reduce it. 

The process engineer can control three variables during the filling process: the per¬ 
cent carbonation (A), the operating pressure in the filler (B), and the bottles produced 
per minute or the line speed (C). The pressure and speed are easy to control, but the 
percent carbonation is more difficult to control during actual manufacturing because it 
varies with product temperature. However, for purposes of an experiment, the engineer 
can control carbonation at three levels: 10, 12, and 14 percent. She chooses two levels 
for pressure (25 and 30 psi) and two levels for line speed (200 and 250 bpm). She decides 
to run two replicates of a factorial design in these three factors, with all 24 runs taken 
in random order. The response variable observed is the average deviation from the target 
fill height observed in a production run of bottles at each set of conditions. The data that 
resulted from this experiment are shown in Table 5-13. Positive deviations are fill heights 
above the target, whereas negative deviations are fill heights below the target. The circled 
numbers in Table 5-13 are the three-way cell totals y ijk _. 

The total corrected sum of squares is found from Equation 5-27 as 

SSt = %%%%*>“ 

(75) 2 

= 571 - K —y- = 336.625 
24 


Table 5-13 Fill Height Deviation Data for Example 5-3 


Operating Pressure (B) 


25 psi 

30 psi 


Line Speed (C) 

Line Speed (C) 

Carbonation (A) 

200 250 200 250 y,_ 

10 I 

12 

14 

i0 -;0 

"O ?© 3 

:® 6© ; 

© :© - 

© 5© 20 

© !!© » 

B X C Totals v.,*. 

6 15 : 

20 34 75 = y.„. 

y.j.. 

21 

54 

A 

X B Totals AX C Totals 


y«.. 

yiJc. 

A 

25 30 A 

200 250 

10 

-5 1 10 

-5 1 

12 

4 16 12 

6 14 

14 

22 37 14 

25 34 
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and the sums of squares for the main effects are calculated from Equations 5-28, 5-29, 
and 5-30 as 

ss C M _ = d -1 A. - -f- 

ben i= i aben 

= j K-4) 2 + <20 f + (59) 2 ] - ^ = 252.750 

SSp™, = — i & - -£=- 

acn y=i aocn 

= 4- [(21) 2 + (54) 2 ] - = 45.375 


i = — E ^ — 

abn k=\ ' ' aben 

= ^ [(26) 2 + (49) 2 ] - ^ = 22.042 


To calculate the sums of squares for the two-factor interactions, we must find the 
two-way cell totals. For example, to find the carbonation-pressure or AB interaction, we 
need the totals for the A X B cells shown in Table 5-13. Using Equation 5-31, we 
find the sums of squares as 


SS AB = - i S y?„ - ~ ss a ~ SS B 

cn "5=1 aben 


1 (75) 2 

~ [(—5) 2 + (l) 2 + (4) 2 + (16) 2 + (22) 2 + (37) 2 ] - 

- 252.750 - 45.375 
= 5.250 


The carbonation-speed or AC interaction uses the A X C cell totals { y Lk .} shown in Table 
5-13 and Equation 5-32: 




= \ [(—5) 2 + (l) 2 + (6) 2 + (14) 2 + (25) 2 + (34) 2 ] - ^ 
- 252.750 - 22.042 


= 0.583 


The pressure-speed or BC interaction is found from the B X C cell totals { y . jk .} shown 
in Table 5-13 and Equation 5-33: 


SS t 


= -'kiyjk.- J jr-ss B -ss c 

an n=i aben 


= ~ [(6) 2 + (15) 2 + (20) 2 + (34) 2 ] - - 45.375 - 22.042 

= 1.042 
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The three-factor interaction sum of squares is found from the A X B X C cell totals 
{ y ijk , \, which are circled in Table 5-13. From Equation 5-34a, we find 

1 a b c 2 

ss ABC = - E 2 2 yfjk. - ^ - ss a - ss B - ss c - ss AB - ss AC - ss BC 

= \ [(-4) 2 + (-1) 2 + (-1) 2 + • • • + (16) 2 + (21 ) 2 ] - 

- 252.750 - 45.375 - 22.042 - 5.250 - 0.583 - 1.042 
= 1.083 

Finally, noting that 

»- a, 328 ^ 

we have 


SS E — SS T SS Subtotals(ABC) 

= 336.625 - 328.125 
= 8.500 

The analysis of variance is summarized in Table 5-14. We see that the percentage 
of carbonation, operating pressure, and line speed significantly affect the fill volume. 
The carbonation-pressure interaction F ratio has a F-value of 0.0558, indicating some 
interaction between these factors. 

The next step should be an analysis of the residuals from this experiment. We leave 
this as an exercise for the reader but point out that a normal probability plot of the 
residuals and the other usual diagnostics do not indicate any major concerns. 

To assist in the practical interpretation of this experiment, Figure 5-16 (page 200) 
presents plots of the three main effects and the AB (carbonation-pressure) interaction. 
The main effect plots are just graphs of the marginal response averages at the levels of 
the three factors. Notice that all three variables have positive main effects; that is, in¬ 
creasing the variable moves the average deviation from the fill target upward. The inter¬ 
action between carbonation and pressure is fairly small, as shown by the similar shape 
of the two curves in Figure 5-l6d. 

Because the company wants the average deviation from the fill target to be close to 
zero, the engineer decides to recommend the low level of operating pressure (25 psi) and 


Table 5-14 Analysis of Variance for Example 5-3 

Sum of Degrees of Mean 

Source of Variation Squares Freedom Square F 0 P-Value 


Percentage of carbonation (A) 252.750 2 

Operating pressure (5) 45.375 1 

Line speed (C) 22.042 1 

AB 5.250 2 

AC 0.583 2 

BC 1.042 1 

ABC 1.083 2 

Error 8.500 12 

Total 336.625 23 


126.375 

45.375 

22.042 

2.625 

0.292 

1.042 

0.542 

0.708 


178.412 

64.059 

31.118 

3.706 

0.412 

1.471 

0.765 


<0.0001 

<0.0001 

0.0001 

0.0558 

0.6713 

0.2485 

0.4867 
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Figure 5-16 Main effects and interaction plots for Example 6-3. (a) 
Percentage of carbonation (A). ( b ) Pressure (B). (c) Line speed (C). (d) 
Carbonation-pressure interaction. 


the high level of line speed (250 bpm, which will maximize the production rate). Figure 
5-17 plots the average observed deviation from the target fill height at the three different 
carbonation levels for this set of operating conditions. Now the carbonation level cannot 
presently be perfectly controlled in the manufacturing process, and the normal distribu¬ 
tion shown with the solid curve in Figure 5-17 approximates the variability in the car- 



Figure 5-17 Average fill height deviation at high speed 
and low pressure for different carbonation levels. 
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bonation levels presently experienced. As the process is impacted by the values of the 
carbonation level drawn from this distribution, the fill heights will fluctuate considerably. 
This variability in the fill heights could be reduced if the distribution of the carbonation 
level values followed the normal distribution shown with the dashed line in Figure 5-17. 
Reducing the standard deviation of the carbonation level distribution was ultimately 
achieved by improving temperature control during manufacturing. 


We have indicated that if all the factors in a factorial experiment are fixed, test 
statistic construction is straightforward. The statistic for testing any main effect or inter¬ 
action is always formed by dividing the mean square for the main effect or interaction 
by the mean square error. However, if the factorial experiment involves one or more 
random factors, the test statistic construction is not always done this way. We must 
examine the expected mean squares to determine the correct tests. We defer a complete 
discussion of experiments with random factors until Chapter 12. 


5-5 FITTING RESPONSE CURVES AND SURFACES 

We have observed that it can be useful to fit a response curve to the levels of a quan¬ 
titative factor so that the experimenter has an equation that relates the response to the 
factor. This equation might be used for interpolation, that is, for predicting the response 
at factor levels between those actually used in the experiment. When at least two factors 
are quantitative, we can fit a response surface for predicting y at various combinations 
of the design factors. In general, linear regression methods are used to fit these models 
to the experimental data. We illustrated this procedure in Section 3-5.1 for an experiment 
with a single factor. We now present two examples involving factorial experiments. We 
will use a computer software package to generate the regression models. For more in¬ 
formation about regression analysis, refer to Chapter 10 and the supplemental text ma¬ 
terial for this chapter. 

EXAMPLE 5-4 . 

Consider the experiment described in Example 5-1. The factor temperature is quantitative 
and material type is qualitative. Furthermore, there are three levels of temperature. Con¬ 
sequently, we can compute a linear and a quadratic temperature effect to study how 
temperature affects the battery life. Table 5-15 (on the next page) presents condensed 
output from Design-Expert for this experiment and assumes that temperature is quanti¬ 
tative and material type is qualitative. 

The analysis of variance in Table 5-15 shows that the “model” source of variability 
has been subdivided into several components. The components “A” and “A 2 ” represent 
the linear and quadratic effects of temperature, and “B” represents the main effect of the 
material type factor. Recall that material type is a qualitative factor with three levels. 
The terms “AB” and “ A 2 B ” are the interactions of the linear and quadratic temperature 
factor with material type. 

The P-values indicate that A 2 and AB are not significant, whereas the A 2 B term is 
significant. Often we think about removing nonsignificant terms or factors from a model 
but in this case, removing A 2 and AB and retaining A 2 B will result in a model that is not 





Table 5-15 Design-Expert Output for Example 5-4 

Response: Life in hr 

ANOVA for Response Surface Reduced Cubic Model 


Analysis of variance table [Partial sum of squares] 





Sum of 

Mean 

F 



Source 

Squares 

DF Square 

Value 1 

Prob > F 


Model 

59416.22 

8 7427.03 

11.00 

<0.0001 significant 

A 

39042.67 

1 39042.67 

57.82 

< 0.0001 


B 

10683.72 

2 5341.86 

7.91 

0.0020 


A 2 

76.06 

1 76.06 

0.11 

0.7398 


AB 

2315.08 

2 1157.54 

1.71 

0.1991 


A 2 B 

7298.69 

2 3649.35 

5.40 

0.0106 


Residual 

18230.75 

27 675.21 




Lack of Fit 

0.000 

0 




Pure Error 

18230.75 

27 675.21 




Cor Total 

77646.97 

35 




Std. Dev. 

25.98 

R-Squared 

0.7652 


Mean 

105.53 

Adj R-Squared 

0.6956 


C.V. 

24.62 

Pred R-Squared 

0.5826 


PRESS 

32410.22 

Adeq Precision 

8.178 



Coefficient 

Standard 

95% Cl 

95% Cl 


Term 

Estimate 

DF Error 

Low 

High 

VIF 

Intercept 

107.58 

1 7.50 

92.19 

122.97 


A-Temp 

-40.33 

1 5.30 

-51.22 

-29.45 

1.00 

B[1] 

-50.33 

1 10.61 

-72.10 

-28.57 


B[2] 

12.17 

1 10.61 

-9.60 

33.93 


A 2 

-3.08 

1 9.19 

-21.93 

15.77 

1.00 

AB[1] 

1.71 

1 7.50 

-13.68 

17.10 


AB[2] 

-12.79 

1 7.50 

-28.18 

2.60 


A 2 B[1] 

41.96 

1 12.99 

15.30 

68.62 


A 2 B[2] 

-14.04 

1 12.99 

-40.70 

12.62 



Final Equation in Terms of Coded Factors: 

Life = 

+ 107.58 
-40.33 *A 
-50.33 *B[1] 

+ 12.17 *B[2] 

-3.08 *A 2 
+ 1.71 *AB[1] 

-12.79 *AB[2] 

+41.96 *A 2 B[1] 

-14.04 *A 2 B[2] 

Final Equation in Terms of Actual Factors: 

Material Type 1 

Life = 

+ 169.38017 

-2.48860 *Temp 
+0.012851 *Temp 2 

Material Type 2 

Life = 

+ 159.62397 

-0.17901 *Temp 
+0.41627 *Temp 2 

Material Type 3 

Life = 

+ 132.76240 
+0.89264 *Temp 
-0.43218 *Temp 2 
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hierarchical. The hierarchy principle indicates that if a model contains a high-order 
term (such as A 2 B), it should also contain all of the lower-order terms that compose it 
(in this case A 2 and AB). Hierarchy promotes a type of internal consistency in a model, 
and many statistical model builders rigorously follow the principal. However, hierarchy 
is not always a good idea, and many models actually work better as prediction equations 
without including the nonsignificant terms that promote hierarchy. For more information, 
see the supplemental text material for this chapter. 

The computer output also gives model coefficient estimates and a final prediction 
equation for battery life in coded factors. In this equation, the levels of temperature are 
A= —1,0, +1 respectively, when temperature is at the low, middle, and high levels (15, 
70, 125°). The variables B[l] and B[ 2] are coded indicator variables that are defined as 
follows: 



Material Type 


1 2 3 

S[l] 

1 0 -1 

B[ 2] 

0 1 -1 


There are also prediction equations for battery life in terms of the actual factor levels. 
Notice that because material type is a qualitative factor there is an equation for predicted 
life as a function of temperature for each material type. Figure 5-18 shows the response 



Temperature 

Figure 5-18 Predicted life as a function of temperature for the three material types. 
Example 5-4. 
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curves generated by these three prediction equations. Compare them to the two-factor 
interaction graph for this experiment in Figure 5-9. 


If several factors in a factorial experiment are quantitative a response surface may 
be used to model the relationship between y and the design factors. Furthermore, the 
quantitative factor effects may be represented by single-degree-of-freedom polynomial 
effects. Similarly, the interactions of quantitative factors can be partitioned into single- 
degree-of-freedom components of interaction. This is illustrated in the following 
example. 

EXAMPLE 5-5 . 

The effective life of a cutting tool installed in a numerically controlled machine is 
thought to be affected by the cutting speed and the tool angle. Three speeds and three 
angles are selected, and a factorial experiment with two replicates is performed. The 
coded data are shown in Table 5-16. The circled numbers in the cells are the cell totals 

{?«.}. 

Condensed Design-Expert output for this example is shown in Table 5-17. The terms 
A and A 2 are the linear and quadratic effects of tool angle, and B and B 2 are the linear 
and quadratic effects of speed. The terms AB, A 2 B, AB 2 , and A 2 B 2 represent linear X 
linear, quadratic X linear, linear X quadratic, and quadratic X quadratic components of 
the two-factor interaction. Although there are some large P-values, all model terms have 
been retained to ensure hierarchy. The prediction equation expressed in coded factors 
uses the levels -1,0, and +1 for A and B to represent the low, middle, and high levels 
of these factors, respectively. 

Figure 5-19 (page 206) presents a contour plot of the surface generated by the pre¬ 
diction equation for tool life. Examination of this response surface indicates that max¬ 
imum tool life is achieved at cutting speeds around 150 rpm and tool angles of 25°. The 
three-dimensional response surface plot in Figure 5-20 (page 206) provides essentially 
the same information, but it provides a different and sometimes more useful perspective 
of the tool life response surface. Exploration of response surfaces is a very important 
aspect of experimental design, which we will discuss in more detail in Chapter 11. 


Table 5-16 

Data for Tool Life Experi 

merit 





Cutting Speed (in/min) 



(degrees) 

125 

150 

175 

y,„ 

15 

:?© 

1© 

IQ 

-l 

20 

°© 

5© 


16 

25 

-J0 

J© 

-°0 

9 

y.j. 

-2 

12 

14 

24 = y... 
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Table 5-17 Design Expert Output for Example 5-5 


Response: Life in Hours 


AIMOVA for Response Surface Reduced Order 4 Model 



Analysis of variance table [Partial sum of squares] 





Sum of 

Mean 

F 



Source 

Squares DF 

Square 

Value Prob > F 



Model 

111.00 8 

13.87 

9.61 0.0013 

significant 

A 

49.00 1 

49.00 

33.92 0.0003 



B 

16.00 1 

16.00 

11.08 0.0088 



A 2 

0.000 1 

0.000 

0.000 1.0000 



B 2 

1.33 1 

1.33 

0.92 0.3618 



AB 

8.00 1 

8.00 

5.54 0.0431 



A 2 B 

2.67 1 

2.67 

1.85 0.2073 



AB 2 

42.67 1 

42.67 

29.54 0.0004 



A 2 B 2 

8.00 1 

8.00 

5.54 0.0431 



Residual 

13.00 9 

1.44 




Lack of Fit 

0.000 0 





Pure Error 

13.00 9 

1.44 




Cor Total 

124.00 17 





Std. Dev. 

1.20 R-Squared 

0.8952 



Mean 

1.33 Adj R-Squared 

0.8020 



C.V. 

90.14 Pred R-Squared 

0.5806 



PRESS 

52.00 Adeq Precision 

8.237 




Coefficient 

Standard 95% Cl 95% Cl 


Factor 

Estimate 

DF Error 

Low 

High 

VIF 

Intercept 

2.00 

1 0.85 

0.078 

3.92 


A-Tool Angle 

3.50 

1 0.60 

2.14 

4.86 

3.00 

B-Speed 

2.00 

1 0.60 

0.64 

3.36 

3.00 

A 2 

0.000 

1 1.04 

-2.35 

2.35 

3.00 

B 2 

1.00 

1 1.04 

-1.35 

3.35 

3.00 

AB 

-1.00 

1 0.42 

-1.96 

0.039 

1.00 

A 2 B 

-1.00 

1 0.74 

-2.66 

0.66 

3.00 

AB 2 

-4.00 

1 0.74 

-5.66 

-2.34 

3.00 

A 2 B 2 

-3.00 

1 1.27 

-5.88 

-0.12 

5.00 


Final Equation in Terms of Coded Factors: 

Life = 

+2.00 
+3.50 *A 
+2.00 *B 
+0.000 *A 2 
+ 1.00 *B 2 
-1.00 *A *B 
-1.00 *A 2 *B 
-4.00 *A *B 2 
-3.00 *A 2 *B 2 

Final Equation in Terms of Actual Factors: 
Life = 

-1068.00000 
+ 136.30000 *Tool Angle 
+ 14.48000 *Speed 
-4.08000 *Tool Angle 2 
-0.049600 *Speed 2 
-1.86400 *Tool Angle *Speed 
+0.056000 *Tool Angle 2 *Speed 
+6.40000E—003 *Tool Angle *Speed 2 
— 1.92000E—004 *Tool Angle 2 *Speed 2 
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5-6 BLOCKING IN A FACTORIAL DESIGN 


We have discussed factorial designs in the context of a completely randomized exper¬ 
iment. Sometimes, it is not feasible or practical to completely randomize all of the runs 
in a factorial. For example, the presence of a nuisance factor may require that the ex¬ 
periment be run in blocks. We discussed the basic concepts of blocking in the context 
of a single-factor experiment in Chapter 4. We now show how blocking can be incor¬ 
porated in a factorial. Some other aspects of blocking in factorial designs are presented 
in Chapters 7, 8, 9, and 13. 

Consider a factorial experiment with two factors (A and B) and n replicates. The 
linear statistical model for this design is 


{ i = 1,2,..., a 
j = 1,2 ,...,b (5-36) 

k = 1,2,..., n 


where 77 , fij, and (r/3) y represent the effects of factors A, B, and the AB interaction, 
respectively. Now suppose that to run this experiment a particular raw material is re¬ 
quired. This raw material is available in batches that are not large enough to allow all 
abn treatment combinations to be run from the same batch. However, if a batch contains 
enough material for ab observations, then an alternative design is to run each of the n 
replicates using a separate batch of raw material. Consequently, the batches of raw ma¬ 
terial represent a randomization restriction or a block, and a single replicate of a complete 
factorial experiment is run within each block. The effects model for this new design is 


yijk = Ti + Pj + (TP)jj + 8 k + € iJk 


i= 1 , 2 ,..., a 
j = 1,2 ,...,b (5-37) 

k= 1,2 


where 8 k is the effect of the Mi block. Of course, within a block the order in which the 
treatment combinations are run is completely randomized. 

The model (Equation 5-37) assumes that interaction between blocks and treatments 
is negligible. This was assumed previously in the analysis of randomized block designs. 
If these interactions do exist, they cannot be separated from the error component. In fact, 
the error term in this model really consists of the (t 8) ik , (j 88) jk , and (Tf38) iJk interactions. 
The analysis of variance is outlined in Table 5-18 on page 208. The layout closely 
resembles that of a factorial design, with the error sum of squares reduced by the sum 
of squares for blocks. Computationally, we find the sum of squares for blocks as the sum 
of squares between the n block totals { y __ k }. 

In the previous example, the randomization was restricted to within a batch of raw 
material. In practice, a variety of phenomena may cause randomization restrictions, such 
as time, operators, and so on. For example, if we could not run the entire factorial 
experiment on one day, then the experimenter could run a complete replicate on day 1, 
a second replicate on day 2, and so on. Consequently, each day would be a block. 


EXAMPLE 5-6 . 

An engineer is studying methods for improving the ability to detect targets on a radar 
scope. Two factors she considers to be important are the amount of background noise, 
or “ground clutter,” on the scope and the type of filter placed over the screen. An 
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Table 5-18 Analysis of Variance for a Two-Factor Factorial in a Randomized Complete Block 


Source of 
Variation 

Sum of Squares 

Degrees of 
Freedom 

Expected 

Mean Square 

F 0 

Blocks 


n - 1 

tr 2 + abcrl 


A 

-1 V j 

bn i abn 

a - 1 

bn'Zrf 

a + T^T 

MS a 

ms e 

B 

JL £ f. - 2=- 

an y y ~ h abn 

b - 1 

an 2 0f 

ms b 

ms e 

AB 


(a - l)(b - 1 ) 

n 2 2 (TfflS 

a + {a - IXb - 1 ) 

ms ab 

ms e 

Error 

Subtraction 

(ab - 1 )(» - 1 ) 

<r 2 


Total 


abn-l 




experiment is designed using three levels of ground clutter and two filter types. We will 
consider these as fixed type factors. The experiment is performed by randomly selecting 
a treatment combination (ground clutter level and filter type) and then introducing a signal 
representing the target into the scope. The intensity of this target is increased until the 
operator observes it. The intensity level at detection is then measured as the response 
variable. Because of operator availability, it is convenient to select an operator and keep 
him or her at the scope until all the necessary runs have been made. Furthermore, op¬ 
erators differ in their skill and ability to use the scope. Consequently, it seems logical to 
use the operators as blocks. Four operators are randomly selected. Once an operator is 
chosen, the order in which the six treatment combinations are run is randomly deter¬ 
mined. Thus, we have a 3 X 2 factorial experiment run in a randomized complete block. 
The data are shown in Table 5-19. 

The linear model for this experiment is 


ym = ii + Ti + fa + + s k + € ijk 


i = 1, 2, 3 

j = 1 . 2 

k = 1, 2, 3, 4 


where t, represents the ground clutter effect, (3 represents the filter type effect, (t/ 3), ; is 
the interaction, S k is the block effect, and e ijk is the NID(0, a 2 ) error component. The 
sums of squares for ground clutter, filter type, and their interaction are computed in the 


Table 5-19 Intensity Level at Target Detection 


Operators (blocks) - - - - 

Filter Type _1_2_1_2_1_2_1_2_ 

Ground clutter 

Low 90 86 96 84 100 92 92 81 

Medium 102 87 106 90 105 97 96 80 

High 114 93 112 91 108 95 98 


83 
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Table 5-20 Analysis of Variance for Example 5-6 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

F-Value 

Ground clutter (G) 

335.58 

2 

167.79 

15.13 

0.0003 

Filter type (F) 

1066.67 

1 

1066.67 

96.19 

<0.0001 

GF 

77.08 

2 

38.54 

3.48 

0.0573 

Blocks 

402.17 

3 

134.06 



Error 

166.33 

15 

11.09 



Total 

2047.83 

23 





usual manner. The sum of squares due to blocks is found from the operator totals [y„ k 
as follows: 

- ok 1(572)2 + <579)2 + (597)2 + <530>21 - S 


The complete analysis of variance for this experiment is summarized in Table 5-20. 
The presentation in Table 5-20 indicates that all effects are tested by dividing their mean 
squares by the mean square error. Both ground clutter level and filter type are significant 
at the 1 percent level, whereas their interaction is significant only at the 10 percent level. 
Thus, we conclude that both ground clutter level and the type of scope filter used affect 
the operator’s ability to detect the target, and there is some evidence of mild interaction 
between these factors. 


In the case of two randomization restrictions, each with p levels, if the number of 
treatment combinations in a ^-factor factorial design exactly equals the number of re¬ 
striction levels, that is, if p = ab ... m, then the factorial design may be run in a p X p 
Latin square. For example, consider a modification of the radar target detection experi¬ 
ment of Example 5-6. The factors in this experiment are filter type (two levels) and 
ground clutter (three levels), and operators are considered as blocks. Suppose now that 
because of the setup time required, only six runs can be made per day. Thus, days become 
a second randomization restriction, resulting in the 6 X 6 Latin square design, as shown 
in Table 5-21 on the next page. In this table we have used the lowercase letters f and gj 
to represent the ith and y'th levels of filter type and ground clutter, respectively. That is, 
fig 2 represents filter type 1 and medium ground clutter. Note that now six operators are 
required, rather than four as in the original experiment, so the number of treatment 
combinations in the 3 X 2 factorial design exactly equals the number of restriction levels. 
Furthermore, in this design, each operator would be used only once on each day. The 
Latin letters A, B, C, D, E, and F represent the 3 X 2 = 6 factorial treatment combinations 
as follows: A = ftg lt B = f x g 2 , C = f x g 3 , D = E = f?g 2 , and F = f 2 g 3 . 

The five degrees of freedom between the six Latin letters correspond to the main 
effects of filter type (one degree of freedom), ground clutter (two degrees of freedom), 
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Table 5-21 Radar Detection Experiment Run in a 6 X 6 Latin Square 


Day 





Operator 




1 

2 


3 


4 

5 

6 


1 

A(f lg , = 90) 

B(f,g2 = 

106) 

C(f,g 3 = 

108) 

Dtfig, = 81) 

F(f2g 3 = 90) 

E(f2g 2 = 

88) 

2 

C(f,g 3 = 114) 

A(f,g , = 

96) 

B(f,g 2 = 

105) 

F(f2g 3 = 83) 

E(f2g 2 = 86) 

D(f 2gl = 

84) 

3 

B(f,g 2 = 102) 

E(f2g2 = 

90) 

nf2g 3 = 

95) 

A(f,g, = 92) 

D(f2gx = 85) 

C(f ig3 = 

104) 

4 

E(f 2 g 2 = 87) 

D(f*i = 

84) 

A(f igl = 

100) 

B(f ig 2 = 96) 

C(fig 3 = 110) 

F(fig 3 = 

91) 

5 

F(f 2 g 3 = 93) 

C(f lg3 = 

112) 

D(figx = 

92) 

E(f 2 g2 = 80) 

A(f>g, = 90) 

B(f,g 2 = 

98) 

6 

D(fig, = 86) 

F(f2g 3 = 

91) 

E(f2g2 = 

97) 

C(f,g 3 = 98) 

B(f,g2 = 100) 

A(f t gi = 

92) 


and their interaction (two degrees of freedom). The linear statistical model for this design 
is 


ytju = /x + a,- + Tj + /3* + (70)* + 0/ + e,*/ 


'i = 1,2 ,..., 6 
j = 1, 2, 3 
* = 1 , 2 
J= 1 , 2 ,..., 6 


(5-38) 


where r ; and /?* are effects of ground clutter and filter type, respectively, and a, and 6, 
represent the randomization restrictions of days and operators, respectively. To compute 
the sums of squares, the following two-way table of treatment totals is helpful: 


Ground Clutter 

Filter Type 1 

Filter Type 2 

y.J: 

Low 

560 

512 

1072 

Medium 

607 

528 

1135 

High 

646 

543 

1189 

y.. k . 

1813 

1583 

3396 = y..„ 


Furthermore, the row and column totals are 

Rows (y.jki): 563 568 568 568 565 564 

Columns (y iJk .): 572 579 597 530 561 557 

The analysis of variance is summarized in Table 5-22. We have added a column to 
this table indicating how the number of degrees of freedom for each sum of squares is 
determined. 


Table 5-22 Analysis of Variance for the Radar Detection Experiment Run as 
in a Latin Square 


Source of Sum of 

Variation Squares 

Ground clutter, G 571.50 

Filter type, F 1469.44 

GF 126.73 

Days (rows) 4.33 

Operators 428.00 

(columns) 

Error 198.00 

Total 2798.00 


Degrees of 
Freedom 
2 

1 

2 
5 
5 

20 

35 


General Formula 
for Degrees of 
Freedom 


b - 1 

(a - l)(b - 1) 
ab — 1 
ab — 1 

(ab - 1 )(ab - 2) 
(ab) 2 - 1 


Mean 

Square 

285.75 

1469.44 

63.37 

0.87 

85.60 

9.90 


3X2 Factorial 


F 0 P-Value 

28.86 <0.0001 

148.43 <0.0001 

6.40 0.0071 
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5-7 PROBLEMS - 

5-1. The yield of a chemical process is being studied. The two most important variables are 
thought to be the pressure and the temperature. Three levels of each factor are selected, 
and a factorial experiment with two replicates is performed. The yield data follow: 


Temperature (°C) 

Pressure (psig) 

200 

215 

230 

150 

90.4 

90.7 

90.2 


90.2 

90.6 

90.4 

160 

90.1 

90.5 

89.9 


90.3 

90.6 

90.1 

170 

90.5 

90.8 

90.4 


90.7 

90.9 

90.1 


(a) Analyze the data and draw conclusions. Use a = 0.05. 

(b) Prepare appropriate residual plots and comment on the model’s adequacy. 

(c) Under what conditions would you operate this process? 

5-2. An engineer suspects that the surface finish of a metal part is influenced by the feed rate 
and the depth of cut. He selects three feed rates and four depths of cut. He then conducts 
a factorial experiment and obtains the following data: 


Feed Rate (in/min) 


Depth of Cut (in) 


0.15 

0.18 

0.20 

0.25 


74 

79 

82 

99 

0.20 

64 

68 

88 

104 


60 

73 

92 

96 


92 

98 

99 

104 

0.25 

86 

104 

108 

110 


88 

88 

95 

99 


99 

104 

108 

114 

0.30 

98 

99 

110 

111 


102 

95 

99 

107 


(a) Analyze the data and draw conclusions. Use a = 0.05. 

(b) Prepare appropriate residual plots and comment on the model’s adequacy. 

(c) Obtain point estimates of the mean surface finish at each feed rate. 

(d) Find the P-values for the tests in part (a). 

5-3. For the data in Problem 5-2, compute a 95 percent confidence interval estimate of the 
mean difference in response for feed rates of 0.20 and 0.25 in/min. 

5-4. An article in Industrial Quality Control (1956, pp. 5-8) describes an experiment to in¬ 
vestigate the effect of the type of glass and the type of phosphor on the brightness of a 
television tube. The response variable is the current necessary (in microamps) to obtain a 
specified brightness level. The data are as follows: 
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Glass 

Type 

Phosphor Type 


1 

2 

3 


280 

300 

290 

1 

290 

310 

285 


285 

295 

290 


230 

260 

220 

2 

235 

240 

225 


240 

235 

230 


(a) Is there any indication that either factor influences brightness? Use a = 0.05. 

(b) Do the two factors interact? Use a = 0.05. 

(c) Analyze the residuals from this experiment. 

5-5. Johnson and Leone (Statistics and Experimental Design in Engineering and the Physical 
Sciences, Wiley, 1977) describe an experiment to investigate warping of copper plates. 
The two factors studied were the temperature and the copper content of the plates. The 
response variable was a measure of the amount of warping. The data were as follows: 


Temperature (°C) 


Copper Content (%) 


40 

60 

80 

100 

50 

17,20 

16,21 

24,22 

28, 27 

75 

12, 9 

18, 13 

17, 12 

27,31 

100 

16, 12 

18, 21 

25, 23 

30, 23 

125 

21, 17 

23, 21 

23,22 

29,31 


(a) Is there any indication that either factor affects the amount of warping? Is there any 
interaction between the factors? Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Plot the average warping at each level of copper content and compare them to an 
appropriately scaled t distribution. Describe the differences in the effects of the 
different levels of copper content on warping. If low warping is desirable, what level 
of copper content would you specify? 

(d) Suppose that temperature cannot be easily controlled in the environment in which 
the copper plates are to be used. Does this change your answer for part (c)? 

5-6. The factors that influence the breaking strength of a synthetic fiber are being studied. Four 
production machines and three operators are chosen and a factorial experiment is run 
using fiber from the same production batch. The results are as follows: 


Machine 


Operator 12 3 4 


2 

3 


109 

110 
110 
112 
116 
114 


110 

115 

110 

111 

112 

115 


108 

109 

111 

109 

114 

119 


110 

108 

114 

112 

120 

117 
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(a) Analyze the data and draw conclusions. Use a= 0.05. 

(b) Prepare appropriate residual plots and comment on the model’s adequacy. 

5-7. A mechanical engineer is studying the thrust force developed by a drill press. He suspects 
that the drilling speed and the feed rate of the material are the most important factors. He 
selects four feed rates and uses a high and low drill speed chosen to represent the extreme 
operating conditions. He obtains the following results. Analyze the data and draw con¬ 
clusions. Use a = 0.05. 


Drill Speed 


Feed Rate 


0.015 

0.030 

0.045 

0.060 

125 

2.70 

2.45 

2.60 

2.75 


2.78 

2.49 

2.72 

2.86 

200 

2.83 

2.85 

2.86 

2.94 


2.86 

2.80 

2.87 

2.88 


An experiment is conducted to study the influence of operating temperature and t 
types of face-plate glass in the light output of an oscilloscope tube. The following 
are collected: 



Temperature 


Glass Type 

100 

125 

150 


580 

1090 

1392 

1 

568 

1087 

1380 


570 

1085 

1386 


550 

1070 

1328 

2 

530 

1035 

1312 


579 

1000 

1299 


546 

1045 

867 

3 

575 

1053 

904 


599 

1066 

889 


(a) Use a = 0.05 in the analysis. Is there a significant interaction effect? Does glass 
type or temperature affect the response? What conclusions can you draw? 

(b) Fit an appropriate model relating light output to glass type and temperature. 

(c) Analyze the residuals from this experiment. Comment on the adequacy of the models 
you have considered. 

5-9. Consider the experiment in Problem 5-1. Fit an appropriate model to the response data. 

Use this model to provide guidance concerning operating conditions for the process. 
5-10. Use Tukey’s test to determine which levels of the pressure factor are significantly different 
for the data in Problem 5-1. 
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5-11. An experiment was conducted to determine if either firing temperature or furnace position 
affects the baked density of a carbon anode. The data are shown below: 


Position 


Temperature (°C) 


800 

825 

850 


570 

1063 

565 

1 

565 

1080 

510 


583 

1043 

590 


528 

988 

526 

2 

547 

1026 

538 


521 

1004 

532 


Suppose we assume that no interaction exists. Write down the statistical model. Conduct 
the analysis of variance and test hypotheses on the main effects. What conclusions can be 
drawn? Comment on the model’s adequacy. 

5-12. Derive the expected mean squares for a two-factor analysis of variance with one obser¬ 
vation per cell, assuming that both factors are fixed. 

5-13. Consider the following data from a two-factor factorial experiment. Analyze the data and 
draw conclusions. Perform a test for nonadditivity. Use a = 0.05. 


Column Factor 

Row Factor 12 3 4 

1 36 39 36 32 

2 18 20 22 20 

3 30 37 33 34 


5-14. The shear strength of an adhesive is thought to be affected by the application pressure 
and temperature. A factorial experiment is performed in which both factors are assumed 
to be fixed. Analyze the data and draw conclusions. Perform a test for nonadditivity. 


Pressure 

(lb/in 2 ) 


Temperature (°F) 


250 

260 

270 

120 

9.60 

11.28 

9.00 

130 

9.69 

10.10 

9.57 

140 

8.43 

11.01 

9.03 

150 

9.98 

10.44 

9.80 


5-15. Consider the three-factor model 

f i = 1, 2,..., a 

yijk = M + T, + Pj + y* + (TP)ij + (fiy) jk + e ijk < j - 1, 2,..., b 
U = 1, 2,..., c 

Notice that there is only one replicate. Assuming all the factors are fixed, write down the 
analysis of variance table, including the expected mean squares. What would you use as 
the “experimental error” to test hypotheses? 
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5-16. The percentage of hardwood concentration in raw pulp, the vat pressure, and the cooking 
time of the pulp are being investigated for their effects on the strength of paper. Three 
levels of hardwood concentration, three levels of pressure, and two cooking times are 
selected. A factorial experiment with two replicates is conducted, and the following data 
are obtained: 


Percentage of 
Hardwood 
Concentration 

Cooking Time 3.0 Hours 

Cooking Time 4.0 Hours 


Pressure 



Pressure 


400 

500 

650 

400 

500 

650 

2 

196.6 

197.7 

199.8 

198.4 

199.6 

200.6 


196.0 

196.0 

199.4 

198.6 

200.4 

200.9 

4 

198.5 

196.0 

198.4 

197.5 

198.7 

199.6 


197.2 

196.9 

197.6 

198.1 

198.0 

199.0 

8 

197.5 

195.6 

197.4 

197.6 

197.0 

198.5 


196.6 

196.2 

198.1 

198.4 

197.8 

199.8 


(a) Analyze the data and draw conclusions. Use a = 0.05. 

(b) Prepare appropriate residual plots and comment on the model’s adequacy. 

(c) Under what set of conditions would you operate this process? Why? 

5-17. The quality control department of a fabric finishing plant is studying the effect of several 
factors on the dyeing of cotton-synthetic cloth used to manufacture men’s shirts. Three 
operators, three cycle times, and two temperatures were selected, and three small speci¬ 
mens of cloth were dyed under each set of conditions. The finished cloth was compared 
to a standard, and a numerical score was assigned. The results follow. Analyze the data 
and draw conclusions. Comment on the model’s adequacy. 


Temperature 


Cycle Time 


300° 



350° 



Operator 



Operator 


1 

2 

3 

1 

2 

3 


23 

27 

31 

24 

38 

34 

40 

24 

28 

32 

23 

36 

36 


25 

26 

29 

28 

35 

39 


36 

34 

33 

37 

34 

34 

50 

35 

38 

34 

39 

38 

36 


36 

39 

35 

35 

36 

31 


28 

35 

26 

26 

36 

28 

60 

24 

35 

27 

29 

37 

26 


27 

34 

25 

25 

34 

24 


5-18. In Problem 5-1, suppose that we wish to reject the null hypothesis with a high probability 
if the difference in the true mean yield at any two pressures is as great as 0.5. If a 
reasonable prior estimate of the standard deviation of yield is 0.1, how many replicates 
should be run? 

5-19. The yield of a chemical process is being studied. The two factors of interest are temper¬ 
ature and pressure. Three levels of each factor are selected; however, only nine runs can 
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be made in one day. The experimenter runs a complete replicate of the design on each 
day. The data are shown in the following table. Analyze the data, assuming that the days 
are blocks. 


Temperature 


Day 1 
Pressure 



Day 2 
Pressure 


250 

260 

270 

250 

260 

270 

Low 

86.3 

84.0 

85.8 

86.1 

85.2 

87.3 

Medium 

88.5 

87.3 

89.0 

89.4 

89.9 

90.3 

High 

89.1 

90.2 

91.3 

91.7 

93.2 

93.7 


5-20. Consider the data in Problem 5-5. Analyze the data, assuming that replicates are blocks. 

5-21. Consider the data in Problem 5-6. Analyze the data, assuming that replicates are blocks. 

5-22. An article in the Journal of Testing and Evaluation (Vol. 16, no. 2, pp. 508-515) inves¬ 
tigated the effects of cyclic loading and environmental conditions on fatigue crack growth 
at a constant 22 MPa stress for a particular material. The data from this experiment are 
shown below (the response is crack growth rate): 




Environment 


Frequency 

Air 

H 2 0 

Salt H 2 0 


2.29 

2.06 

1.90 

10 

2.47 

2.05 

1.93 

2.48 

2.23 

1.75 


2.12 

2.03 

2.06 


2.65 

3.20 

3.10 


2.68 

3.18 

3.24 

1 

2.06 

3.96 

3.98 


2.38 

3.64 

3.24 


2.24 

11.00 

9.96 

0.1 

2.71 

11.00 

10.01 

2.81 

9.06 

9.36 


2.08 

11.30 

10.40 


(a) Analyze the data from this experiment (use a = 0.05). 

(b) Analyze the residuals. 

(c) Repeat the analyses from parts (a) and (b) using In (y) as the response. Comment on 
the results. 

5-23. An article in the IEEE Transactions on Electron Devices (Nov. 1986, pp. 1754) describes 
a study on polysilicon doping. The experiment shown below is a variation of their study. 
The response variable is base current. 


Polysilicon 
Doping (ions) 


Anneal Temperature (°C) 

900 

950 

1000 

1 X 10 20 

4.60 

4.40 

10.15 

10.20 

11.01 

10.58 

2 X 10 20 

3.20 

3.50 

9.38 

10.02 

10.81 

10.60 
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(a) Is there evidence (with a = 0.05) indicating that either polysilicon doping level or 
anneal temperature affect base current? 

(b) Prepare graphical displays to assist in interpretation of this experiment. 

(c) Analyze the residuals and comment on model adequacy. 

(d) Is the model 

y = Po + + /3 2 X 2 + 022*2 + 012*1*2 + e 

supported by this experiment (jc, = doping level, x 2 = temperature)? Estimate the 
parameters in this model and plot the response surface. 


The 2 k Factorial 
Design _ 


6-1 INTRODUCTION 

Factorial designs are widely used in experiments involving several factors where it is 
necessary to study the joint effect of the factors on a response. Chapter 5 presented 
general methods for the analysis of factorial designs. However, there are several special 
cases of the general factorial design that are important because they are widely used in 
research work and also because they form the basis of other designs of considerable 
practical value. 

The most important of these special cases is that of k factors, each at only two levels. 
These levels may be quantitative, such as two values of temperature, pressure, or time; 
or they may be qualitative, such as two machines, two operators, the “high” and “low” 
levels of a factor, or perhaps the presence and absence of a factor. A complete replicate 
of such a design requires 2X2X---X2 = 2* observations and is called a 2 k factorial 
design. 

This chapter focuses on this extremely important class of designs. Throughout this 
chapter we assume that (1) the factors are fixed, (2) the designs are completely random¬ 
ized, and (3) the usual normality assumptions are satisfied. 

The 2 k design is particularly useful in the early stages of experimental work, when 
there are likely to be many factors to be investigated. It provides the smallest number of 
runs with which k factors can be studied in a complete factorial design. Consequently, 
these designs are widely used in factor screening experiments. 

Because there are only two levels for each factor, we assume that the response is 
approximately linear over the range of the factor levels chosen. In many factor screening 
experiments, when we are just starting to study the process or system, this is often a 
reasonable assumption. In Section 6-6, we will present a simple method for checking 
this assumption, and discuss what action to take if it is violated. 
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6-2 THE 2 2 DESIGN 

The first design in the 2 k series is one with only two factors, say A and B, each run at 
two levels. This design is called a 2 2 factorial design. The levels of the factors may be 
arbitrarily called “low” and “high.” As an example, consider an investigation into the 
effect of the concentration of the reactant and the amount of the catalyst on the conversion 
(yield) in a chemical process. Let the reactant concentration be factor A, and let the two 
levels of interest be 15 and 25 percent. The catalyst is factor B, with the high level 
denoting the use of 2 pounds of the catalyst and the low level denoting the use of only 
1 pound. The experiment is replicated three times, and the data are as follows: 


A B 

Treatment 

Combination 


Replicate 


Total 

I 

II 

III 

- 

A low, B low 

28 

25 

27 

80 

+ - 

A high, B low 

36 

32 

32 

100 

+ 

A low, B high 

18 

19 

23 

60 

+ + 

A high, B high 

31 

30 

29 

90 


The treatment combinations in this design are shown graphically in Figure 6-1. By 
convention, we denote the effect of a factor by a capital Latin letter. Thus “A” refers to 
the effect of factor A, “B” refers to the effect of factor B, and “AB” refers to the AB 
interaction. In the 2 2 design the low and high levels of A and B are denoted by “ — ” and 
“+, ” respectively, on the A and B axes. Thus, - on the A axis represents the low level 
of concentration (15%), whereas + represents the high level (25%), and - on the B axis 
represents the low level of catalyst, whereas + denotes the high level. 

The four treatment combinations in the design are usually represented by lowercase 
letters, as shown in Figure 6-1. We see from the figure that the high level of any factor 
in the treatment combination is denoted by the corresponding lowercase letter and that 
the low level of a factor in the treatment combination is denoted by the absence of the 


b = 60 ab = 90 

High (18+ 19 + 23) (31 +30 + 29) 

(2 pounds) T 1 

n 

e | 

< s 

(1 pound) (1) = 80 a = 100 

(28 + 25 + 27) (36 + 32 + 32) 


Low High 

(15%) (25%) 


Reactant 

concentration, 

A 


Figure 6-1 Treatment combinations in the 2 2 design. 
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corresponding letter. Thus, a represents the treatment combination of A at the high level 
and B at the low level, b represents A at the low level and B at the high level, and ab 
represents both factors at the high level. By convention, (1) is used to denote both factors 
at the low level. This notation is used throughout the 2 k series. 

In a two-level factorial design, we may define the average effect of a factor as the 
change in response produced by a change in the level of that factor averaged over the 
levels of the other factor. Also, the symbols (1 ), a, b, and ab now represent the total of 
all n replicates taken at the treatment combination, as illustrated in Figure 6-1. Now the 
effect of A at the low level of B is [a - (1)]/« and the effect of A at the high level of B 
is [ab ~b]/n. Averaging these two quantities yields the main effect of A: 

A = ±- {[ab — b\ + [a — (1)]} 

In 

= ^-[ab + a-b- (1)] (6-1) 

2 n 

The average main effect of B is found from the effect of B at the low level of A (i.e., 
[b - (l)]/«) and at the high level of A (i.e., [ab — a]/n) as 

B = ±- {[ab -a] + [b- (1)]} 

In 

= ^[ab + b- a- (1)] (6-2) 

In 

We define the interaction effect AB as the average difference between the effect of 
A at the high level of B and the effect of A at the low level of B. Thus, 

AB = Yn {[ab ~ b] ~ [a ~ (1)]} 

= ^ [ab + (1) - a - b] (6-3) 

Alternatively, we may define AB as the average difference between the effect of B 
at the high level of A and the effect of B at the low level of A. This will also lead to 
Equation 6-3. 

The formulas for the effects of A, B, and AB may be derived by another method. 
The effect of A can be found as the difference in the average response of the two treatment 
combinations on the right-hand side of the square in Figure 6-1 (call this average y A +, 
because it is the average response at the treatment combinations where A is at the high 
level) and the two treatment combinations on the left-hand side (or y A ~). That is, 

A = y A + - y A - 

_ ab + a b + (1) 

2n 2 n 

= ^-[ab + a- b- (1)] 

In 

This is exactly the same result as in Equation 6-1. The effect of B, Equation 6-2, is 
found as the difference between the average of the two treatment combinations on the 
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top of the square (y B +) and the average of the two treatment combinations on the bottom 
or 


1 = Tfl + - y B - 
_ ab + b _ 
~ ~2n 

1 


Finally, the interaction effect AB is the average of the right-to-left diagonal treatment 
combinations in the square [ab and (1)] minus the average of the left-to-right diagonal 
treatment combinations (a and b), or 

AB _ ab + (1) _ a + b 
2 n In 

= ^ [ab + (1) - a - b ] 

In 


which is identical to Equation 6-3. 

Using the experiment in Figure 6-1, we may estimate the average effects as 
A = (90 + 100 - 60 - 80) = 8.33 

B = 2^3) (9 ° + 60 " 100 ~ 80) = _5 '°° 

AB - ^ (90 + 80 - 100 - 60) = 1.67 

The effect of A (reactant concentration) is positive; this suggests that increasing A from 
the low level (15%) to the high level (25%) will increase the yield. The effect of B 
(catalyst) is negative; this suggests that increasing the amount of catalyst added to the 
process will decrease the yield. The interaction effect appears to be small relative to the 
two main effects. 

In many experiments involving 2 k designs, we will examine the magnitude and 
direction of the factor effects to determine which variables are likely to be important. 
The analysis of variance can generally be used to confirm this interpretation. There are 
several excellent statistics software packages that are useful for setting up and analyzing 
2 k designs. There are also special time-saving methods for performing the calculations 
manually. 

Consider the sums of squares for A, B, and AB. Note from Equation 6-1 that a 
contrast is used in estimating A, namely 

Contrast = ab + a - b — (1) (6-4) 

We usually call this contrast the total effect of A. From Equations 6-2 and 6-3, we see 
that contrasts are also used to estimate B and AB. Furthermore, these three contrasts are 
orthogonal. The sum of squares for any contrast can be computed from Equation 3-29, 
which states that the contrast sum of squares is equal to the contrast squared divided by 
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the number of observations in each total in the contrast times the sum of the squares of 
the contrast coefficients. Consequently, we have 


SS A 

SS B 


[ab + a - b - (l)] 2 
An 

[ab + b - a - (l)] 2 
An 


(6-5) 

( 6 - 6 ) 


and 


SS AB 


[ab + (1) - a - bf 
An 


(6-7) 


as the sums of squares for A, B, and AB. 

Using the experiment in Figure 6-1, we may find the sums of squares from Equations 
6-5, 6-6, and 6-7 as 


SS A 

SS B 


(50) 2 = 
4(3) 

(—30) 2 

4(3) 


( 6 - 8 ) 


and 


. (10) 2 . 
' 4(3) ' 


The total sum of squares i: 


SS T - E E E ' 


(6-9) 


In general, SS T has An - 1 degrees of freedom. The error sum of squares, with 4 (n - 1) 
degrees of freedom, is usually computed by subtraction as 

SS E = SS T - SS A - SS B - SS AB (6-10) 


For the experiment in Figure 6-1, we obtain 


2 2 3 V 2 

ss T = 2 2 2 ym ~ ^ 

= 9398.00 - 9075.00 = 323.00 


and 


SS E = SS r - SS A ~ SS B - SS AB 

= 323.00 - 208.33 - 75.00 - 8.33 
= 31.34 

using SS A , SS B , and SS AB from Equations 6-8. The complete analysis of variance is 
summarized in Table 6-1. Based on the F-values, we conclude that the main effects are 
statistically significant and that there is no interaction between these factors. This con¬ 
firms our initial interpretation of the data based on the magnitudes of the factor effects. 
It is often convenient to write down the treatment combinations in the order (1), a, 
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Table 6-1 

Analysis of Variance for the Experiment in Figure 6-1 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

A 

208.33 

1 

208.33 

53.15 

0.0001 

B 

75.00 

1 

75.00 

19.13 

0.0024 

AB 

8.33 

1 

8.33 

2.13 

0.1826 

Error 

31.34 

8 

3.92 



Total 

323.00 

11 





b, ab. This is referred to as standard order (or Yates’ order, for Dr. Frank Yates). Using 
this standard order, we see that the contrast coefficients used in estimating the effects are 

Effects 

(1) 

a 

b ab 

A: 

-1 

+ 1 

-1 +1 

B : 

-1 

-1 

+ 1 +1 

AB: 

+ 1 

-1 

-1 +1 


Note that the contrast coefficients for estimating the interaction effect are just the product 
of the corresponding coefficients for the two main effects. The contrast coefficient is 
always either +1 or -1, and a table of plus and minus signs such as in Table 6-2 can 
be used to determine the proper sign for each treatment combination. The column head¬ 
ings in Table 6-2 are the main effects (A and B), the AB interaction, and I, which rep¬ 
resents the total or average of the entire experiment. Notice that the column corresponding 
to / has only plus signs. The row designators are the treatment combinations. To find the 
contrast for estimating any effect, simply multiply the signs in the appropriate column 
of the table by the corresponding treatment combination and add. For example, to esti¬ 
mate A, the contrast is — (1) + a — b + ab, which agrees with Equation 6-1. 

The Regression Model 

In a 2 k factorial design, it is easy to express the results of the experiment in terms of a 
regression model. Because the 2 k is just a factorial design, we could also use either an 
effects or a means model, but the regression model approach is much more natural and 
intuitive. For the chemical process experiment in Figure 6-1, the regression model is 

y = A) + 01*1 + 02*2 + € 

where x, is a coded variable that represents the reactant concentration and x 2 is a coded 
variable that represents the amount of catalyst and the /3’s are regression coefficients. 


Table 6-2 Algebraic Signs for Calculating Effects 
in the 2 2 Design 


Treatment 

Combination 


Factorial Effect 

I A B AB 


( 1 ) 

b 


ab 
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The relationship between the natural variables, the reactant concentration and the 
amount of catalyst, and the coded variables is 

_ Cone - (Conci ow + Conc high )/2 
Xl (Conchigh - Conc low )/2 

and 

_ Catalyst - (Catalyst low + Catalyst high )/2 
(Catalysthigh - Catalyst, ow )/2 

When the natural variables have only two levels, this coding will produce the familiar 
± 1 notation for the levels of the coded variables. To illustrate this for our example, note 
that 

= Cone - (15 + 25)/2 
~~ (25 - 15)/2 

_ Cone - 20 
5 

Thus, if the concentration is at the high level (Cone = 25%), then x x = +1; if the 
concentration is at the low level (Cone = 15%), then x x = -1. Furthermore, 

_ Catalyst - (1 + 2)/2 
* 2 “ (2 - l)/2 

_ Catalyst -1.5 
0.5 


Thus, if the catalyst is at the high level (Catalyst = 2 pounds), then x 2 = +1; if the 
catalyst is at the low level (Catalyst = 1 pound), then x 2 = -1. 

The fitted regression model is 

/ 8.33\ /—5.00\ 

j - 27 - 5+ (—)*'♦!—j* 

where the intercept is the grand average of all 12 observations, and the regression co¬ 
efficients pi and p 2 are one-half the corresponding factor effect estimates. The reason 
that the regression coefficient is one-half the effect estimate is that a regression coefficient 
measures the effect of a unit change in x on the mean of y, and the effect estimate is 
based on a two-unit change (from -1 to +1). We will show later that this simple method 
of estimating the regression coefficients is producing least squares parameter estimates. 
Also see the supplemental material for this chapter. 


Residuals and Model Adequacy 

The regression model can be used to obtain the predicted or fitted value of y at the four 
points in the design. The residuals are the differences between the observed and fitted 
values of y. For example, when the reactant concentration is at the low level (x ] = -1) 
and the catalyst is at the low level ( x 2 = -1), the predicted yield is 


= 25.835 
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There are three observations at this treatment combination, and the residuals are 

e l = 28 - 25.835 = 2.165 
e 2 = 25 - 25.835 = -0.835 
e 3 = 27 - 25.835 = 1.165 

The remaining predicted values and residuals are calculated similarly. For the high level 
of the reactant concentration and the low level of the catalyst, 

, = 27.5 + (f)( + l, + (^)(-., 

= 34.165 
and 

e 4 = 36 - 34.165 = 1.835 
e s = 32 - 34.165 = -2.165 
e 6 = 32 — 34.165 = -2.165 

For the low level of the reactant concentration and the high level of the catalyst, 

* = 2 7 .5 + (f)(-l> + (^°)< + „ 

= 20.835 
and 

e 7 = 18 - 20.835 = -2.835 
e 8 = 19 - 20.835 = -1.835 
e 9 = 23 - 20.835 = 2.165 

Finally, for the high level of both factors, 

, = 27, + (f)c +I > + (^°°) (+] , 

= 29.165 
and 

e,o = 31 - 29.165 = 1.835 
<?„ = 30 - 29.165 = 0.835 
e 12 = 29 - 29.165 = -0.165 

Figure 6-2 (page 226) presents a normal probability plot of these residuals and a plot of 
the residuals versus the predicted yield. These plots appear satisfactory, so we have no 
reason to suspect problems with the validity of our conclusions. 


The Response Surface 
The regression model 


„ /8.33\ /—5.00\ 

y = 27 - 5 + {—) x ' + \—)^ 
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can be used to generate response surface plots. If it is desirable to construct these plots 
in terms of the natural factor levels, then we simply substitute the relationships between 
the natural and the coded variables that we gave earlier into the regression model, yielding 

>-”*(?)(“) 

= 18.33 + 0.8333Conc - 5.00Catalyst 

Figure 6-3a presents the three-dimensional response surface plot of yield from this 
model, and Figure 6-3 b is the contour plot. Because the model is first-order (that is, it 
contains only the main effects), the fitted response surface is a plane. From examining 
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(6) Contour plot 

Figure 6-3 Response surface plot and contour plot of yield from the chemical 
process experiment. 


the contour plot, we see that yield increases as reactant concentration increases and 
catalyst amount decreases. Often, we use a fitted surface such as this to find a direction 
of potential improvement for a process. A formal way to do this, called the method of 
steepest ascent, will be presented in Chapter 11 when we discuss methods for system¬ 
atically exploring response surfaces. 
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6-3 THE 2 3 DESIGN 

Suppose that three factors, A, B, and C, each at two levels, are of interest. The design is 
called a 2 3 factorial design and the eight treatment combinations can now be displayed 
geometrically as a cube, as shown in Figure 6-Aa. Using the “+ and — ” notation to 
represent the low and high levels of the factors, we may list the eight runs in the 2 3 
design as in Figure 6-4 b. This is sometimes called the design matrix. Extending the 
label notation discussed in Section 6-2, we write the treatment combinations in standard 
order as (1), a, b, ab, c, ac, be, and abc. Remember that these symbols also represent 
the total of all n observations taken at that particular treatment combination. 

There are actually three different notations that are widely used for the runs in the 2 k 
design. The first is the + and - notation, often called the geometric notation. The 
second is the use of lowercase letter labels to identify the treatment combinations. The 
final notation uses 1 and 0 to denote high and low factor levels, respectively, instead of 
+ and -. These different notations are illustrated below for the 2 3 design: 


Run 

A 

B 

C 

Labels 

A 

B 

c 

1 

- 

- 

- 

(1) 

0 

0 

0 

2 

+ 

- 

- 

a 

1 

0 

0 

3 

- 

+ 

- 

b 

0 

1 

0 

4 

+ 

+ 

- 

ab 

1 

1 

0 

5 

- 

- 

+ 

c 

0 

0 

1 

6 

+ 

- 

+ 

ac 

1 

0 

1 

7 

- 

+ 

+ 

be 

0 

1 

1 

8 

+ 

+ 

+ 

abc 

1 

1 

1 


There are seven degrees of freedom between the eight treatment combinations in the 
2 3 design. Three degrees of freedom are associated with the main effects of A, B, and C. 
Four degrees of freedom are associated with interactions; one each with AB, AC, and BC 
and one with ABC. 

Consider estimating the main effects. First, consider estimating the main effect A. 
The effect of A when B and C are at the low level is [a - (l)]/n. Similarly, the effect of 
A when B is at the high level and C is at the low level is [ab — b]/n. The effect of A 



(a) Geometric view (6) The design matrix 

Figure 6-4 The 2 3 factorial design. 




6-3 THE V DESIGN 229 


when C is at the high level and B is at the low level is [ac — c]/n. Finally, the effect of 
A when both B and C are at the high level is [abc — bc]/n. Thus, the average effect of 
A is just the average of these four, or 

A — [a — (1) + ab — b + ac - c + abc - be] (6-11) 

4 n 

This equation can also be developed as a contrast between the four treatment com¬ 
binations in the right face of the cube in Figure 6-5 a (where A is at the high level) and 
the four in the left face (where A is at the low level). That is, the A effect is just the 
average of the four runs where A is at the high level (y A +) minus the average of the four 
runs where A is at the low level (y A ~), or 

A = y A + ~ y A - 

_ a + ab + ac + abc (1 ) + b + c + be 

An An 

This equation can be rearranged as 

A = ^ [a + ab + ac + abc - (1) - b - c - be] 
which is identical to Equation 6-11. 



AB AC BC 

ib) Two-factor interaction 



ABC 


(c) Three-factor interaction 

Figure 6-5 Geometric presentation of contrasts corresponding to the main effects 
and interactions in the 2 3 design. 
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In a similar manner, the effect of B is the difference in averages between the four 
treatment combinations in the front face of the cube and the four in the back. This yields 

B =y B + ~ yB- 

= — [b + ab + be + abc - (1 ) — a — c - ac] (6-12) 

An 

The effect of C is the difference in averages between the four treatment combinations in 
the top face of the cube and the four in the bottom, that is, 

C = y c * ~ yc- 

= — [c + ac + be + abc - (1) — a - b - ab] (6-13) 

An 

The two-factor interaction effects may be computed easily. A measure of the AB 
interaction is the difference between the average A effects at the two levels of B. By 
convention, one-half of this difference is called the AB interaction. Symbolically, 

B _ Average A Effect _ 

High (+) [(abc - be) + (ab - b)] 

2n 

Low (-) [(ac - c) + [a - (1)]} 

2 n 

Difference [abc — be + ab — b — ac + c — a + (1)] 

In 


Because the AB interaction is one-half of this difference, 

M = [abc - be + ab - b-ac + c- a + (1)] ^ ^ 

An 

We could write Equation 6-14 as follows: 

abc + ab + c + ( 1) be + b + ac + a 

AB = ---^- 

An An 

In this form, the AB interaction is easily seen to be the difference in averages between 
runs on two diagonal planes in the cube in Figure 6-5 b. Using similar logic and referring 
to Figure 6-5 b, the AC and BC interactions are 

AC = — [(1) - a + b-ab — c + ac — be + abc] (6-15) 

An 

and 


BC = — [(1) + a - b - ab - c - ac + be + abc] 


( 6 - 16 ) 
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The ABC interaction is defined as the average difference between the AB interaction 
for the two different levels of C. Thus, 


As before, we can think of the ABC interaction as the difference in two averages. If the 
runs in the two averages are isolated, they define the vertices of the two tetrahedra that 
comprise the cube in Figure 6-5c. 

In Equations 6-11 through 6-17, the quantities in brackets are contrasts in the treat¬ 
ment combinations. A table of plus and minus signs can be developed from the contrasts 
and is shown in Table 6-3. Signs for the main effects are determined by associating a 
plus with the high level and a minus with the low level. Once the signs for the main 
effects have been established, the signs for the remaining columns can be obtained by 
multiplying the appropriate preceding columns, row by row. For example, the signs in 
the AB column are the product of the A and B column signs in each row. The contrast 
for any effect can be obtained easily from this table. 

Table 6-3 has several interesting properties: (1) Except for column /, every column 
has an equal number of plus and minus signs. (2) The sum of the products of the signs 
in any two columns is zero. (3) Column / multiplied times any column leaves that column 
unchanged. That is, I is an identity element. (4) The product of any two columns yields 
a column in the table. For example, AX B = AB, and 

AB X B = AB 2 = A 


We see that the exponents in the products are formed by using modulus 2 arithmetic. 
(That is, the exponent can only be 0 or 1; if it is greater than 1, it is reduced by multiples 
of 2 until it is either 0 or 1.) All of these properties are implied by the orthogonality of 
the contrasts used to estimate the effects. 

Sums of squares for the effects are easily computed, because each effect has a cor¬ 
responding single-degree-of-freedom contrast. In the 2 3 design with n replicates, the sum 
of squares for any effect is 


SS = 


(Contrast) 2 
8 n 


(6-18) 


Table 6-3 Algebraic Signs for Calculating Effects in the 2 3 Design 
Treatm^nf Factorial Effect 


BC ABC 


be 
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EXAMPLE 6-1 . 

Recall Example 5-3, which presented a study on the effect of percentage of carbonation, 
operating pressure, and line speed on the fill height of a carbonated beverage. Suppose 
that only two levels of carbonation are used so that the experiment is a 2 3 factorial design 
with two replicates. The data, deviations from the target fill height, are shown in Table 
6-4, and the design is shown geometrically in Figure 6-6. 

Using the totals under the treatment combinations shown in Table 6-4, we may 
estimate the factor effects as follows: 

A = — \a — (1) + ab — b + ac — c + abc — be] 

An 

= \u~ (-4) + 5 - (-1) + 3 - (-1) +11-2] 

= | [24] = 3.00 

B = — [b + ab + be +abc - (1) - a - c - ac] 

An 

= | [-1 +5 + 2 + 11— (-4) - 1 - (-1) - 3] 

= \ US] = 2.25 

C = — [c + ac + be + abc - (1) - a - b - ab] 

An 

[-1 + 3 + 2 + 11 - (-4) - 1 - (-1) - 5] 

= i [14] = 1.75 

AB = — [ab - a - b + (1) + abc - be - ac + c] 

An 

= | [5 - 1 - (-1) + (-4) + 11 - 2 - 3 + (-1)] 

= | [6] = 0.75 


Table 6-4 The Fill Height Experiment, Example 6-1 


Run 

Coded Factors 

Fill Height Deviation 

Factor Levels 

A 

B 

C 

Replicate 1 

Replicate 2 

Low (—1) 

High (+1) 

1 

-1 

-1 

-1 

-3 

-1 

A (%) 10 

12 

2 

1 

-1 

-1 

0 

1 

B (psi) 25 

30 

3 

-1 

1 

-1 

-1 

0 

C (b/min) 200 

250 

4 

1 

1 

-1 

2 

3 



5 

-1 

-1 

1 

-1 

0 



6 

1 

-1 

1 

2 

1 



8 

1 

1 

1 

6 

5 
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Figure 6-6 The 2 3 design for the fill height deviation experiment 
for Example 6-1. 


AC — — [(1) — a + b — ab — c + ac — be + abc ] 

= i I -4 “ 1 + (-1) ~ 5 “ ( -1 ) + 3-2+11] 
= l [2] = 0.25 

BC = — [(1) + a- b-ab-c - ac + be + abc ] 

= J [-4 + 1 - (-1) - 5 - (-1) -3 + 2+11] 
= | [4] = 0.50 


and 


A5C = ^ [a&c -ftc-ac + c-afe + 6 + a- (1)] 

= | [11 - 2 - 3 + (-1) - 5 + (-1) + 1 - (-4)] 
= l [4] = 0.50 


The largest effects are for carbonation (A = 3.00), pressure (B = 2.25), speed (C = 1.75) 
and the carbonation-pressure interaction (AB = 0.75), although the interaction effect 
does not appear to have as large an impact on fill height deviation as the main effects. 
The sums of squares are calculated from Equation 6-18 as follows: 


SS A = 
SS B = 
SS C = 


(24 ) 2 
16 

(18) 2 

16 

(14) 2 

16 


36.00 

20.25 

12.25 
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Table 6-5 Effect Estimate Summary for Example 64 


Factor 

Effect 

Estimate 

Sum of 
Squares 

Percent 

Contribution 

A 

3.00 

36.00 

46.1538 

B 

2.25 

20.25 

25.9615 

C 

1.75 

12.25 

15.7051 

AB 

0.75 

2.25 

2.88462 

AC 

0.25 

0.25 

0.320513 

BC 

0.50 

1.00 

1.28205 

ABC 

0.50 

1.00 

1.28205 

Pure Error 


5.00 

6.41026 

Total 


78.00 



SS AC = ^ = 0.25 
(4) 2 

SS BC = ^= 1.00 

and 

SS ABC = ^- = 1.00 

The total sum of squares is SS T = 78.00, and by subtraction, SS E = 5.00. Table 6-5 
summarizes the effect estimates and sums of squares. The column labeled “percent 
contribution” measures the percentage contribution of each model term to the total sum 
of squares. The percentage contribution is often a rough but effective guide to the relative 
importance of each model term. Note that the main effects really dominate this process, 
accounting for over 87 percent of the total variability, whereas the AB interaction ac¬ 
counts for less than 3 percent. 

The analysis of variance in Table 6-6 may be used to confirm the magnitude of these 
effects. From Table 6-6 we note that the main effects are highly significant (all have very 


Table 6-6 Analysis of Variance for the Fill Height Data 


Source of 

Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

F 0 

P-Value 

Percent carbonation (A) 

36.00 

1 

36.00 

57.60 

<0.0001 

Pressure (S) 

20.25 

1 

20.25 

32.40 

0.0005 

Line speed (C) 

12.25 

1 

12.25 

19.60 

0.0022 

AB 

2.25 

1 

2.25 

3.60 

0.0943 

AC 

0.25 

1 

0.25 

0.40 

0.5447 

BC 

1.00 

1 

1.00 

1.60 

0.2415 

ABC 

1.00 

1 

1.00 

1.60 

0.2415 

Error 

5.00 

8 

0.625 



Total 

78.00 

15 
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small P-values). The AB interaction is significant at about the 10 percent level; thus there 
is some mild interaction between carbonation and pressure. 

The reader may wish to refer to Example 5-3 for the practical interpretation of this 
experiment. The process developers decided to run the process at low pressure and high 
line speed and to reduce the variability in carbonation by controlling the temperature 
more precisely. This resulted in a substantial reduction in the deviation of fill height from 
the target value. 


The Regression Model and Response Surface 

The regression model for predicting fill height deviation is 

f = 00 + Pl x l + 02*2 + 03*3 + 012*1*2 

where the coded variables x u x 2 , and x 3 represent A, B, and C, respectively. The x x x 2 
term is the AB interaction. Residuals can be obtained as the difference between observed 
and predicted fill height deviations. We leave the analysis of these residuals as an exercise 
for the reader. 

Figure 6-7 (page 236) presents the response surface and contour plot for fill height 
deviation obtained from the regression model, assuming that line speed is at the high 
level (x 3 = 1). Notice that because the model contains interaction, the contour lines of 
constant fill height deviation are curved (or the response surface is a “twisted” plane). 
It is desirable to operate this filling process so that fill deviation is as close to zero as 
possible. The contour plot shows that if line speed is at the high level, then there are 
several combinations of pressure and carbonation level that will satisfy this objective. 
However, it will be necessary to control both of these variables very precisely. 

Computer Solution 

There are many statistics software packages that will set up and analyze two-level fac¬ 
torial designs. The output from one of these computer programs, Design-Expert, is shown 
in Table 6-7 (pages 237, 238). In the upper part of the table an analysis of variance for 
the full model is presented. The format of this presentation is somewhat different from 
the results given in Table 6-6. Notice that the first line of the analysis of variance is an 
overall summary for the full model (all main effects and interactions), and the model 
sum of squares is 

SS Modc , = SS A + SS B + SSc + SS AB + ss AC + SS BC + ss ABC 
= 73.0 


Thus the statistic 


F 0 


MS Modcl 

ms e 


is testing the hypotheses 

Ho-Pi = 02 = 03 = 012 = 013 = 023 = 0123 = 0 
Hi : at least one ft + 0 






Table 6-7 Design-Expert Output for Example 6-1 

Response: Fill Deviation in Height 

AIMOVA for Selected Factorial Model 


Analysis of variance table [Partial sum of squares] 



Sum of 


Mean 

F 


Source 

Squares 

DF 

Square 

Value 

Prob > F 

Model 

73.00 

7 

10.43 

16.69 

0.0003 

A 

36.00 

1 

36.00 

57.60 

<0.0001 

B 

20.25 

1 

20.25 

32.40 

0.0005 

C 

12.25 

1 

12.25 

19.60 

0.0022 

AB 

2.25 

1 

2.25 

3.60 

0.0943 

AC 

0.25 

1 

0.25 

0.40 

0.5447 

BC 

1.00 

1 

1.00 

1.60 

0.2415 

ABC 

1.00 

1 

1.00 

1.60 

0.2415 

Residual 

5.00 

8 

0.63 



Lack of Fit 

0.000 

0 




Pure Error 

5.00 

8 

0.63 



Cor Total 

78.00 

15 





significant 


Std. Dev. 
Mean 
C.V. 
PRESS 


0.79 R-Squared 

1.00 Adj R-Squared 

79.06 Pred R-Squared 

20.00 Adeq Precision 


0.9359 

0.8798 

0.7436 

13.416 


Factor 

Intercept 

A-Carbonation 

B-Pressure 

C-Speed 

AB 

AC 

BC 

ABC 


Coefficient 

Estimate 

1.00 

1.50 

1.13 


0.13 

0.25 

0.25 


Standard 
DF Error 

1 0.20 

1 0.20 

1 0.20 

1 0.20 

1 0.20 

1 0.20 

1 0.20 

1 0.20 


95% Cl 95% Cl 
Low High 

0.54 1.46 

1.04 1.96 

0.67 1.58 

0.42 1.33 

-0.081 0.83 

-0.33 0.58 

-0.21 0.71 

-0.21 0.71 


VIF 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 


Final Equation in Terms of Coded Factors: 

Fill Deviation = 

+ 1.00 
+ 1.50 "A 
+ 1.13 *B 
+0.88 *C 
+0.38 *A *B 
+0.13 "A *C 
+0.25 *B "C 
+0.25 "A *B "C 


Final Equation in Terms of Actual Factors: 

Fill Deviation = 

-225.50000 

+21.00000 *Carbonation 
+7.80000 "Pressure 
+ 1.08000 *Speed 
-0.75000 *Carbonation "Pressure 
-0.10500 "Carbonation "Speed 
-0.040000 *Pressure *Speed 
+4.00000E-00: *Carbonation "Pressure "Speed 


Reduced Model: 

Response: Fill Deviation in Height 

ANOVA for Selected Factorial Model 
Analysis of variance table [Partial sum of squares] 
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Table 6-7 (continued) 



Sum of 


Mean 

F 



Source 

Squares 

DF 

Square 

Value 

Prob > F 


Model 

70.75 

4 

17.69 

26.84 

<0.0001 

significant 

A 

36.00 

1 

36.00 

54.62 

<0.0001 


B 

20.25 

1 

20.25 

30.72 

0.0002 


C 

12.25 

1 

12.25 

18.59 

0.0012 


AB 

2.25 

1 

2.25 

3.41 

0.0917 


Residual 

7.25 

11 

0.66 




Lack of Fit 

2.25 

3 

0.75 

1.20 

0.3700 

not significant 

Pure Error 

5.00 

8 

0.63 




Cor Total 

78.00 

15 






Std. Dev. 
Mean 
C.V. 
PRESS 


0.81 R-Squared 

1.00 Adj R-Squared 

81.18 Pred R-Squared 

15.34 Adeq Precision 


0.9071 

0.8733 

0.8033 

15.424 


Coefficient 

Factor Estimate DF 

Intercept 1.00 1 

A-Carbonation 1.50 1 

B-Pressure 1.13 1 

C-Speed 0.88 1 

AB 0.38 1 


Error 

0.20 

0.20 

0.20 

0.20 

0.20 


95% Cl 
Low 

0.55 

1.05 

0.68 

0.43 

-0.072 


95% Cl 
High 
1.45 
1.95 
1.57 
1.32 
0.82 


VIF 

1.00 

1.00 

1.00 

1.00 


Final Equation in Terms of Coded Factors: 

Fill Deviation = 

+1.00 
+ 1.50 *A 
+ 1.13 *B 
+0.88 *C 
+0.38 *A *B 


Final Equation in Terms of Actual Factors: 

Fill Deviation = 

+9.62500 

-2.62500 *Carbonation 
-1.20000 ‘Pressure 
+0.035000 *Speed 
+0.15000 ‘Carbonation ‘Pressure 


Diagnostics Case Statistics 
Standard Actual Predicted 
Order Value Value 

1 -3.00 -2.13 

2 -1.00 -2.13 

3 0.000 0.12 

4 1.00 0.12 

5 -1.00 -0.63 

6 0.000 -0.63 

7 2.00 3.13 

8 3.00 3.13 

9 -1.00 -0.37 

10 0.000 -0.37 

11 2.00 1.88 

12 1.00 1.88 

13 1.00 1.13 

14 1.00 1.13 

15 6.00 4.88 

16 5.00 4.88 


Student 

Residual Leverage Residual 

-0.88 0.313 -1.300 

1.13 0.313 1.671 

-0.12 0.313 -0.186 

0.88 0.313 1.300 

-0.38 0.313 -0.557 

0.63 0.313 0.928 

-1.13 0.313 -1.671 

-0.13 0.313 -0.186 

-0.63 0.313 -0.928 

0.37 0.313 0.557 

0.13 0.313 0.186 

-0.88 0.313 -1.300 

-0.13 0.313 -0.186 

-0.13 0.313 -0.186 

1.13 0.313 1.671 

0.13 0.313 0.186 


Cook's Outlier 

Distance t 

0.154 -1.347 

0.254 1.845 

0.003 -0.177 

0.154 1.347 

0.028 -0.539 

0.078 0.922 

0.254 -1.845 

0.003 -0.177 

0.078 -0.922 

0.028 0.539 

0.003 0.177 

0.154 -1.347 

0.003 -0.177 

0.003 -0.177 

0.254 1.845 

0.003 0.177 


238 



6-3 THE 2 3 DESIGN 


239 


Because F 0 is large, we would conclude that at least one variable has a nonzero effect. 
Then each individual factorial effect is tested for significance using the F statistic. These 
results agree with Table 6-6. 

Below the full model analysis of variance, several R 2 statistics are presented. The 
ordinary R 2 is 


R 2 


^Model 

‘S'&rotal 


73.00 

78.00 


and it measures the proportion of total variability explained by the model. A potential 
problem with this statistic is that it always increases as factors are added to the model, 
even if these factors are not significant. The adjusted R 2 statistic, defined as 


R 


2 

Adj 


ss E /df E 

ss Tot jdf Tc 


5.00/8 

78.00/15 


= 0.8798 


is a statistic that is adjusted for the “size” of the model; that is, the number of factors. 
The adjusted R 2 can actually decrease if nonsignificant terms are added to a model. The 
PRESS statistic is a measure of how well the model will predict new data (PRESS is 
actually an acronym for Prediction Error Sum of Squares, and it is computed from the 
prediction errors obtained by predicting the ith data point with a model that includes all 
observations except the /th one.). A model with a small value of PRESS indicates that 
the model is likely to be a good predictor. The “Prediction R 2 ” statistic is computed as 


This indicates that the full model would be expected to explain about 74 percent of the 
variability in new data. 

The next portion of the output presents the regression coefficient for each model 
term and the standard error of each coefficient, defined as 


«0S) = W(fo = 


I MSe 
V nl k 



0.20 


The 95 percent confidence intervals on each regression coefficient are computed from 


j8 - t amxN - p se{P) ^0^/3 + t 0 .025,N- P se(P) 

where the degrees of freedom on t are the number of degrees of freedom for error; that 
is, N is the total number of runs in the experiment (16), and p is the number of model 
parameters (8). The full model in terms of both the coded variables and the natural 
variables is also presented. 

The last part of the display in Table 6-7 illustrates the output following removal of 
the nonsignificant interaction terms. This reduced model now contains only the main 
effects A, B, and C, and the AB interaction. The error or residual sum of squares is now 
composed of a pure error component arising from the replication of the eight comers 
of the cube, and a lack of fit component consisting of the sums of squares for the 
interactions that were dropped from the model (BC, AC, and ABC). Once again, the 
regression model representation of the experimental results is given in terms of both 
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coded and natural variables. The proportion of total variability in fill height deviation 
that is explained by this model is 


R 2 


SS Madcl 

SSfotal 


which is smaller than the R 2 for the full model. Notice, however, that the adjusted R 2 for 
the reduced model has changed only slightly from the adjusted R 2 for the full model, 
and PRESS for the reduced model is considerably smaller, leading to a larger value of 
Rp red for the reduced model. Clearly removing the nonsignificant terms from the full 
model has produced a final model that is likely to function more effectively as a predictor 
of new data. Notice that the confidence intervals on the regression coefficients for the 
reduced model are slightly shorter than the corresponding confidence intervals for the 
full model. 

The last part of the output presents the residuals from the reduced model. Design- 
Expert will also construct all of the residual plots that we have previously discussed. 


Other Methods for Judging the Significance of Effects 

The analysis of variance is a formal way to determine which factor effects are nonzero. 
There are several other methods that are useful. Below, we show how to calculate the 
standard error of the effects and use these standard errors to construct confidence 
intervals on the effects. Another method, which we will illustrate in Section 6-5, uses 
normal probability plots to assess the importance of the effects. 

The standard error of an effect is easy to find. If we assume that there are n replicates 
at each of the 2 k runs in the design, and if y lu y a , Vm are the observations at the z'th 

run, then 


S 2 = -J— 2 (y, - yf i= 12,..., 2* 
n — l j=i 


is an estimate of the variance at the z'th run. The 2 k variance estimates can be combined 
to give an overall variance estimate: 


2 \n - 1) 


2 2 <y v ~ yi) 2 


(6-19) 


This is also the variance estimate given by the error mean square in the analysis of 
variance. The variance of each effect estimate is 


T(Effect) = V 


{ Contrast\ 

V " 2*- 1 ) 


' (n2 k 'f 


Each contrast is a linear combination of 2 k treatment totals, and each total consists of n 
observations. Therefore, 


^(Contrast) = n2 k a 2 
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and the variance of an effect is 

^Effect) = -i- 7-3 n2V 2 
(«2 ) 



2a 

Vri2 k 


The estimated standard error would be found by taking the square root of this last ex¬ 
pression and replacing a 2 by its estimate S 2 : 


se(Effect) 


2 S 


( 6 - 20 ) 


Notice that the standard error of an effect is twice the standard error of an estimated 
regression coefficient in the regression model for the 2 k design (see the Design-Expert 
computer output for Example 6-1). 

The 100(1 - a) percent confidence intervals on the effects are computed from 
Effect ± fa^.w-p^Effect), where the degrees of freedom on t are just the error or residual 
degrees of freedom (N — p = total number of runs — number of model parameters). 

To illustrate this method, consider the fill height deviation experiment in Example 
6-1. The mean square error is MS E = 0.625. Therefore, the standard error of each effect 
is (using S 2 = MS E ) 

re(Effect) = 


V20 5 ) 

0.40 


Now to. 025,8 = 2.31 and i 0 . 025,8 ^(Effect) = 2.31(0.40) = 0.92, so approximate 95 
percent confidence intervals on the factor effects are 


A 

3.00 ± 0.92 

B 

2.25 ± 0.92 

C 

1.75 ± 0.92 

AB 

0.75 ± 0.92 

AC 

0.25 ± 0.92 

BC 

0.50 ± 0.92 

ABC 

0.50 ± 0.92 


This analysis indicates that A, B, and C are important factors, because they are the only 
factor effect estimates for which the approximate 95 percent confidence intervals do not 
include zero. 


Dispersion Effects 

The process engineer working on the filling process was also interested in dispersion 
effects; that is, do any of the factors affect variability in fill height deviation from run 
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Figure 6-8 Ranges of fill height deviation for Example 6-1. 


to run? One way to answer the question is to look at the range of fill height deviations 
for each of the eight runs in the 2 3 design. These ranges are plotted on the cube in Figure 
6-8. Notice that the ranges are about the same for all eight runs in the design. Conse¬ 
quently, there is no strong evidence indicating that some of the process variables directly 
affect the variability in fill height deviation in the process. 


6-4 THE GENERAL 2 k DESIGN 

The methods of analysis that we have presented thus far may be generalized to the case 
of a 2* factorial design, that is, a design with k factors each at two levels. The statistical 
model for a 2* design would include k main effects, (£) two-factor interactions, (3) three- 
factor interactions,..., and one ^-factor interaction. That is, for a 2* design the complete 
model would contain 2 k - 1 effects. The notation introduced earlier for treatment com¬ 
binations is also used here. For example, in a 2 s design abd denotes the treatment com¬ 
bination with factors A, B, and D at the high level and factors C and E at the low level. 

The treatment combinations may be written in standard order by introducing the factors 
one at a time, with each new factor being successively combined with those that precede j 
it. For example, the standard order for a 2 4 design is (1), a, b, ab, c, ac, be, abc, d, ad, 
bd, abd, cd, acd, bed, and abed. I 

The general approach to the statistical analysis of the 2* design is summarized in 
Table 6-8. As we have indicated previously, a computer software package is usually 
employed in this analysis process. 

The sequence of steps in Table 6-8 should, by now, be familiar. The first step is to 
estimate factor effects and examine their signs and magnitudes. This gives the experi- j 

Table 6-8 Analysis Procedure 
for a 2 k Design 

1. Estimate factor effects 

2. Form initial model 

3. Perform statistical testing 

4. Refine model 

5. Analyze residuals 

6. Interpret results 
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menter preliminary information regarding which factors and interactions may be impor¬ 
tant, and in which directions these factors should be adjusted to improve the response. 
In forming the initial model for the experiment, we usually choose the full model, that 
is, all main effects and interactions, provided that at least one of the design points has 
been replicated (in the next section, we discuss a modification to this step). Then in step 
3, we use the analysis of variance to formally test for significance of main effects and 
interaction. Table 6-9 shows the general form of an analysis of variance for a 2 k factorial 
design with n replicates. Step 4, refine the model, usually consists of removing any 
nonsignificant variables from the full model. Step 5 is the usual residual analysis to check 
for model adequacy and to check assumptions. Sometimes model refinement will occur 
after residual analysis, if we find that the model is inadequate or assumptions are badly 
violated. The final step usually consists of graphical analysis—either main effect or 
interaction plots, or response surface and contour plots. 

Although the calculations described above are almost always done with a computer, 
occasionally it is necessary to manually calculate an effect estimate or sum of squares 
for an effect. To estimate an effect or to compute the sum of squares for an effect, we 
must first determine the contrast associated with that effect. This can always be done by 
using a table of plus and minus signs, such as Table 6-2 or Table 6-3. However, for large 
values of k this is awkward, and we can use an alternate method. In general, we determine 
the contrast for effect AB • • ■ K by expanding the right-hand side of 

Contrast,,*,..* = (a ± l)(b ± 1) • • • (k ± 1) (6-21) 


Table 6-9 Analysis of Variance for a 2 k Design 
Source of Sum of 

Variation Squares 


Degrees of 
Freedom 


(2) two-factor interactions 


(*) three-factor interactions 
ABC 
ABD 


(I) = 1 ^-factor interaction 
ABC ■■■K 
Error 
Total 
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In expanding Equation 6-21, ordinary algebra is used with “1” being replaced by (1) in 
the final expression. The sign in each set of parentheses is negative if the factor is included 
in the effect and positive if the factor is not included. 

To illustrate the use of Equation 6-21, consider a 2 3 factorial design. The contrast 
for AB would be 

Contrasts = (a - 1 )(b - l)(c + 1) 

= abc + ab + c + (1) — ac — be — a - b 

As a further example, in a 2 5 design, the contrast for ABCD would be 

Contrast*^ = (a - 1 )(b - l)(c - 1 )(d - l)(e + 1) 

= abede + ede +bde + ade + bee 
+ ace + abe + e + abed + cd + bd 
+ ad + be + ac + ab + (1) - a — b - c 

— abc — d — abd — acd — bed — ae 

— be — ce — abce — de — abde — aede — bede 

Once the contrasts for the effects have been computed, we may estimate the effects 
and compute the sums of squares according to 

2 

AB ■ • • K = —r (Contrast*^..*) (6-22) 

nl 

and 

SS AB ... K = (Contrast*^..*) 2 (6-23) 

respectively, where n denotes the number of replicates. There is also a tabular algorithm 
due to Dr. Frank Yates that can occasionally be useful for manual calculation of the 
effect estimates and the sums of squares. Refer to the supplemental text material for this 
chapter. 


6-5 A SINGLE REPLICATE OF THE 2 k DESIGN 

For even a moderate number of factors, the total number of treatment combinations in a 
2 k factorial design is large. For example, a 2 5 design has 32 treatment combinations, a 
2 6 design has 64 treatment combinations, and so on. Because resources are usually lim¬ 
ited, the number of replicates that the experimenter can employ may be restricted. Fre¬ 
quently, available resources only allow a single replicate of the design to be run, unless 
the experimenter is willing to omit some of the original factors. 

An obvious risk when conducting an experiment that has only one run at each test 
combination is that we may be fitting a model to noise. That is, if the response y is highly 
variable, misleading conclusions may result from the experiment. The situation is illus¬ 
trated in Figure 6-9 a. In this figure, the straight line represents the true factor effect. 
However, because of the random variability present in the response variable (represented 
by the shaded band), the experimenter actually obtains the two measured responses rep¬ 
resented by the dark dots. Consequently, the estimated factor effect is close to zero, and 
the experimenter has reached an erroneous conclusion concerning this factor. Now if 
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(a) Small distance between factor levels 



Factor, * 

(6) Aggressive spacing of factor levels 
Figure 6-9 The impact of the choice of factor levels in an unreplicated 
design. 

there is less variability in the response, the likelihood of an erroneous conclusion will be 
smaller. Another way to ensure that reliable effect estimates are obtained is to increase 
the distance between the low (—) and high (+) levels of the factor, as illustrated in Figure 
6-9 b. Notice that in this figure, the increased distance between the low and high factor 
levels results in a reasonable estimate of the true factor effect. 

The single-replicate strategy is often used in screening experiments when there are 
relatively many factors under consideration. Because we can never be entirely certain in 
such cases that the experimental error is small, a good practice in these types of exper¬ 
iments is to spread out the factor levels aggressively. You might find it helpful to re-read 
the guidance on choosing factor levels in Chapter 1. 

A single replicate of a 2 k design is sometimes called an unreplicated factorial. With 
only one replicate, there is no internal estimate of error (or “pure error”). One approach 
to the analysis of an unreplicated factorial is to assume that certain high-order interactions 
are negligible and combine their mean squares to estimate the error. This is an appeal to 
the sparsity of effects principle; that is, most systems are dominated by some of the 
main effects and low-order interactions, and most high-order interactions are negligible. 
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When analyzing data from unreplicated factorial designs, occasionally real high- 
order interactions occur. The use of an error mean square obtained by pooling high-order 
interactions is inappropriate in these cases. A method of analysis attributed to Daniel 
(1959) provides a simple way to overcome this problem. Daniel suggests examining a 
normal probability plot of the estimates of the effects. The effects that are negligible 
are normally distributed, with mean zero and variance a 2 and will tend to fall along a 
straight line on this plot, whereas significant effects will have nonzero means and will 
not lie along the straight line. Thus the preliminary model will be specified to contain 
those effects that are apparently nonzero, based on the normal probability plot. The 
apparently negligible effects are combined as an estimate of error. 

EXAMPLE 6-2 . 

A Single Replicate of the 2 4 Design 

A chemical product is produced in a pressure vessel. A factorial experiment is carried 
out in the pilot plant to study the factors thought to influence the filtration rate of this 
product. The four factors are temperature (A), pressure (B), concentration of formalde¬ 
hyde (C), and stirring rate (£>). Each factor is present at two levels. The design matrix 
and the response data obtained from a single replicate of the 2 4 experiment are shown 
in Table 6-10 and Figure 6-10. The 16 runs are made in random order. The process 
engineer is interested in maximizing the filtration rate. Current process conditions give 
filtration rates of around 75 gal/h. The process also currently uses the concentration of 
formaldehyde, factor C, at the high level. The engineer would like to reduce the form¬ 
aldehyde concentration as much as possible but has been unable to do so because it 
always results in lower filtration rates. 

We will begin the analysis of this data by constructing a normal probability plot of 
the effect estimates. The table of plus and minus signs for the contrast constants for the 


Table 6-10 Pilot Plant Filtration Rate Experiment 


Run 

Number 



Factor 


Run Label 

Filtration 

Rate 

(gal/h) 

A 

B 

C 

D 

1 

- 

- 

- 

- 

(1) 

45 

2 

+ 

- 

- 

- 

a 

71 

3 

- 

+ 

- 

- 

b 

48 

4 

+ 

+ 

- 

- 

ab 

65 

5 

- 

- 

+ 

- 

c 

68 

6 

+ 

- 

+ 

- 

ac 

60 

7 

- 

+ 

+ 


be 

80 

8 

+ 

+ 

+ 


abc 

65 

9 

- 

- 

- 

t 

d 

43 

10 

+ 

- 

- 

+ 

ad 

100 

11 

- 

+ 

- 

+ 

bd 

45 

12 

+ 

+ 

- 

+ 

abd 

104 

13 

- 

- 

+ 

+ 

cd 

75 

14 

+ 

- 

+ 

+ 

acd 

86 

15 

- 

+ 

+ 

+ 

bed 

70 

16 

+ 

+ 

+ 

+ 

abed 

96 
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Figure 6-10 Data from the pilot plant filtration rate experiment for 
Example 6-2. 


2 4 design are shown in Table 6-11 (page 248). From these contrasts, we may estimate 
the 15 factorial effects and the sums of squares shown in Table 6-12 (page 249). 

The normal probability plot of these effects is shown in Figure 6-11 (page 249). All 
of the effects that lie along the line are negligible, whereas the large effects are far from 
the line. The important effects that emerge from this analysis are the main effects of A, 
C, and D and the AC and AD interactions. 

The main effects of A, C, and D are plotted in Figure 6-12 a (page 250). All three 
effects are positive, and if we considered only these main effects, we would run all three 
factors at the high level to maximize the filtration rate. However, it is always necessary 
to examine any interactions that are important. Remember that main effects do not have 
much meaning when they are involved in significant interactions. 

The AC and AD interactions are plotted in Figure 6-12 b. These interactions are the 
key to solving the problem. Note from the AC interaction that the temperature effect is 
very small when the concentration is at the high level and very large when the concen¬ 
tration is at the low level, with the best results obtained with low concentration and high 
temperature. The AD interaction indicates that stirring rate D has little effect at low 
temperature but a large positive effect at high temperature. Therefore, the best filtration 
rates would appear to be obtained when A and D are at the high level and C is at the low 
level. This would allow the reduction of the formaldehyde concentration to a lower level, 
another objective of the experimenter. 


Design Projection 

Another interpretation of the effects in Figure 6-11 is possible. Because B (pressure) is 
not significant and all interactions involving B are negligible, we may discard B from 
the experiment so that the design becomes a 2 3 factorial in A, C, and D with two repli¬ 
cates. This is easily seen from examining only columns A, C, and D in the design matrix 
shown in Table 6-10 and noting that those columns form two replicates of a 2 3 design. 
The analysis of variance for the data using this simplifying assumption is summarized 
in Table 6-13 (page 250). The conclusions that we would draw from this analysis are 
essentially unchanged from those of Example 6-2. Note that by projecting the single 
replicate of the 2 4 into a replicated 2 3 , we now have both an estimate of the ACD inter¬ 
action and an estimate of error based on what is sometimes called hidden replication. 
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Table 6-12 Factor Effect Estimates and Sums of Squares for the 2 4 
Factorial in Example 6-2 


Model 

Term 

Effect 

Estimate 

Sum of 
Squares 

Contribution 

A 

21.625 

1870.56 

32.6397 

B 

3.125 

39.0625 

0.681608 

C 

9.875 

390.062 

6.80626 

D 

14.625 

855.563 

14.9288 

AB 

0.125 

0.0625 

0.00109057 

AC 

-18.125 

1314.06 

22.9293 

AD 

16.625 

1105.56 

19.2911 

BC 

2.375 

22.5625 

0.393696 

BD 

-0.375 

0.5625 

0.00981515 

CD 

-1.125 

5.0625 

0.0883363 

ABC 

1.875 

14.0625 

0.245379 

ABD 

4.125 

68.0625 

1.18763 

ACD 

-1.625 

10.5625 

0.184307 

BCD 

-2.625 

27.5625 

0.480942 

ABCD 

1.375 

7.5625 

0.131959 



-18.12 -8.19 1.75 11.69 21.62 


Effect 

Figure 6-11 Normal probability plot of the effects for the 2 4 factorial in Example 6-2. 
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(6) Interaction plots 

Figure 6-12 Main effect and interaction plots for Example 6-2. 


Table 6-13 Analysis of Variance for the Pilot Plant Filtration Rate Experiment in A, C, and D 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

A 

1870.56 

1 

1870.56 

83.36 

<0.0001 

C 

390.06 

1 

390.06 

17.38 

<0.0001 

D 

855.56 

1 

855.56 

38.13 

<0.0001 

AC 

1314.06 

1 

1314.06 

58.56 

<0.0001 

AD 

1105.56 

1 

1105.56 

49.27 

<0.0001 

CD 

5.06 

1 

5.06 

<1 


ACD 

10.56 

1 

10.56 

<1 


Error 

179.52 

8 

22.44 



Total 

5730.94 

15 
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The concept of projecting an unreplicated factorial into a replicated factorial in fewer 
factors is very useful. In general, if we have a single replicate of 2 k design, and if 
h(h<k ) factors are negligible and can be dropped, then the original data correspond to 
a full two-level factorial in the remaining k — h factors with 2 h replicates. 

Diagnostic Checking 

The usual diagnostic checks should be applied to the residuals of a 2 k design. Our analysis 
indicates that the only significant effects are A = 21.625, C = 9.875, D = 14.625, 
AC = -18.125, and AD = 16.625. If this is true, the estimated filtration rates are given 
by 



where 70.06 is the average response and the coded variables x u jc 3 , x 4 take on values 
between -1 and +1. The predicted filtration rate at run (1) is 



= 46.22 

Because the observed value is 45, the residual is e = y — y = 45 — 46.22 = —1.22. 
The values of y, y, and e = y - y for all 16 observations follow. 



y 

y 

e = y~$ 

(1) 

45 

46.22 

-1.22 

a 

71 

69.39 

1.61 

b 

48 

46.22 

1.78 

ab 

65 

69.39 

-4.39 

c 

68 

74.23 

-6.23 

ac 

60 

61.14 

-1.14 

be 

80 

74.23 

5.77 

abc 

65 

61.14 

3.86 

d 

43 

44.22 

-1.22 

ad 

100 

100.65 

-0.65 

bd 

45 

44.22 

0.78 

abd 

104 

100.65 

3.35 

cd 

75 

72.23 

2.77 

acd 

86 

92.40 

-6.40 

bed 

70 

72.23 

-2.23 

abed 

96 

92.40 

3.60 


A normal probability plot of the residuals is shown in Figure 6-13 on the next page. The 
points on this plot lie reasonably close to a straight line, lending support to our conclusion 
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-6.375 -3.34375 -0.3125 2.71875 5.75 

Residual 

Figure 6-13 Normal probability plot of residuals for Example 6-2. 


that A, C, D, AC, and AD are the only significant effects and that the underlying 
sumptions of the analysis are satisfied. 


The Response Surface 

We used the interaction plots in Figure 6-12 to provide a practical interpretation of the 
results of this experiment. Sometimes we find it helpful to use the response surface for 
this purpose. The response surface is generated by the regression model 


y = 70.06 + 


m 




18.125\ 
, 2 ) 


16.625\ 

— 2 — / X>X * 


Figure 6-14a shows the response surface contour plot when stirring rate is at the high 
level (i.e., x 4 = 1). The contours are generated from the above model with x 4 = 1, or 


Notice that the contours are curved lines because the model contains an interaction term. 

Figure 6-14 b is the response surface contour plot when temperature is at the high 
level (i.e., x, = 1). When we put x, = 1 in the regression model we obtain 



y = 80.8725 


31.25 
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Figure 6-14 Contour plots of filtration rate, Example 6-2. 


These contours are parallel straight lines because the model contains only the main effects 
of factors C (x 3 ) and D (x 4 ). 

Both contour plots indicate that if we want to maximize the filtration rate, variables 
A (jci) and D (x 4 ) should be at the high level and that the process is relatively robust to 
concentration C. We obtained similar conclusions from the interaction graphs. 

The Half-Normal Plot of Effects 

An alternative to the normal probability plot of the factor effects is the half-normal plot. 
This is a plot of the absolute value of the effect estimates against their cumulative normal 
probabilities. Figure 6-15 (page 254) presents the half-normal plot of the effects for 
Example 6-2. The straight line on the half-normal plot always passes through the origin 
and should also pass close to the fiftieth percentile data value. Many analysts feel that 
the half-normal plot is easier to interpret, particularly when there are only a few effect 
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Effect 

Figure 6-15 Half-normal plot of the factor effects from Example 6-2. 


estimates such as when the experimenter has used an eight-run design. Some software 
packages will construct both plots. 


Other Methods for Analyzing Unreplicated Factorials 

The standard analysis procedure for an unreplicated two-level factorial design is the normal 
(or half-normal) plot of the estimated factor effects. However, unreplicated designs are so 
widely used in practice that many formal analysis procedures have been proposed to over¬ 
come the subjectivity of the normal probability plot. Hamada and Balakrishnan (1998) 
compared some of these methods. They found that the method proposed by Lenth (1989) 
has good power to detect significant effects. It is also easy to implement, and as a result, 
it is beginning to appear in some software packages for analyzing data from unreplicated 
factorials. We give a brief description of Lenth’s method. 

Suppose that we have m contrasts of interest, say c l5 c 2 , ■ ■ ■, c m . If the design is an 
unreplicated 2 k factorial design, these contrasts correspond to the m = 2 k - 1 factor 
effect estimates. The basis of Lenth’s method is to estimate the variance of a contrast 
from the smallest (in absolute value) contrast estimates. Let 

s 0 = 1.5 X median( I Cj I) 


and 


PSE = 1.5 X median!|c y | :|c,-| < 2.5%) 
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PSE is called the “pseudo standard error,” and Lenth shows that it is a reasonable 
estimator of the contrast variance when there are not many active (significant) effects. 
The PSE is used to judge the significance of contrasts. An individual contrast can be 
compared to the margin of error 


ME — too25,d X PSE 

where the degrees of freedom are defined as d = m/3. For inference on a group of 
contrasts Lenth suggests using the simultaneous margin of error 

SME = t yM X PSE 

where the percentage point of the t distribution used is y = 1 - (1 + 0.95 1/m )/2. 

To illustrate Lenth’s method, consider the 2 4 experiment in Example 6-2. The cal¬ 
culations result in s 0 = 1.5 X | -2.6251 = 3.9375 and 2.5 X 3.9375 = 9.84375, so 

PSE = 1.5 X 11.751 = 2.625 
ME = 2.571 X 2.625 = 6.75 
SME = 5.219 X 2.625 = 13.70 

Now consider the effect estimates in Table 6-12. The SME criterion would indicate that 
the four largest effects (in magnitude) are significant, because their effect estimates ex¬ 
ceed SME. The main effect of C is significant according to the ME criterion, but not with 
respect to SME. However, because the AC interaction is clearly important, we would 
probably include C in the list of significant effects. Notice that in this example, Lenth’s 
method has produced the same answer that we obtained previously from examination of 
the normal probability plot of effects. 

Several authors [see Hamada and Balakrishnan (1998), Loughin (1998), Loughin 
and Noble (1997), and Lamtz and Whitcomb (1998)] have observed that Lenth’s method 
fails to control type I error rates, and that simulation methods can be used to calibrate 
his procedure. Lamtz and Whitcomb (1998) suggest replacing the original ME and SME 
multipliers with adjusted multipliers as follows: 


Number of Contrasts 

7 

15 

31 

Original ME 

3.764 

2.571 

2.218 

Adjusted ME 

2.295 

2.140 

2.082 

Original SME 

9.008 

5.219 

4.218 

Adjusted SME 

4.891 

4.163 

4.030 


These are in close agreement with the results in Ye and Hamada (2000). 

In general, the Lenth method is a clever and useful procedure. However, we rec¬ 
ommend using it as a supplement to the usual normal probability plot of effects, not as 
a replacement for it. 

Bisgaard (1998-1999) has provided a nice graphical technique, called a conditional 
inference chart, to assist in interpreting the normal probability plot. The purpose of the 
graph is to help the experimenter in judging significant effects. This would be relatively 
easy if the standard deviation a were known, or if it could be estimated from the data. 
In unreplicated designs, there is no internal estimate of tr, so the conditional inference 
chart is designed to help the experimenter evaluate effect magnitude for a range of 
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standard deviation values. Bisgaard bases the graph on the result that the standard error 
of an effect in a two-level design with N runs (for an unreplicated factorial, N = 2 k ) is 

2a 

Vn 

where a is the standard deviation of an individual observation. Then ±2 times the stan¬ 
dard error of an effect is 

4 a 

~Vn 

Once the effects are estimated, plot a graph as shown in Figure 6-16, with the effect 
estimates plotted along the vertical, or y-axis. In this figure, we have used the effect 
estimates from Example 6-2.The horizontal, or x-axis, of Figure 6-16 is a standard de¬ 
viation (<t) scale. The two lines are at 

4cr 4 a 

v = -I—and v =- 

y Vn Vn 

In our example, N = 16, so the lines are at y = +a and y = -a. Thus, for any given 
value of the standard deviation a we can read off the distance between these two lines 
as an approximate 95 percent confidence interval on the negligible effects. 

In Figure 6-16, we observe that if the experimenter thinks that the standard deviation 
is between 4 and 8, then factors A, C, D, and the AC and AD interactions are significant. 
If he or she thinks that the standard deviation is as large as 10, factor C may not be 
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Figure 6-17 Data from the drilling experiment of Example 6-3. 


significant. That is, for any given assumption about the magnitude of er, the experimenter 
can construct a “yardstick” forjudging the approximate significance of effects. The 
chart can also be used in reverse. For example, suppose that we were uncertain about 
whether factor C is significant. The experimenter could then ask whether it is reasonable 
to expect that a could be as large as 10 or more. If it is unlikely that a is as large as 10, 
then we can conclude that C is significant. 

We now present three instructive examples of unreplicated 2 k factorial designs. 

EXAMPLE 6-3 . 

Data Transformation in a Factorial Design 

Daniel (1976) describes a 2 4 factorial design used to study the advance rate of a drill as 
a function of four factors: drill load (A), flow rate (B), rotational speed (C), and the type 
of drilling mud used (D). The data from the experiment are shown in Figure 6-17. 

The normal probability plot of the effect estimates from this experiment is shown in 
Figure 6-18. Based on this plot, factors B, C, and D along with the BC and BD interactions 



0 1 2 3 4 5 6 7 

Effect estimate 

Figure 6-18 Normal probability plot of effects for Example 6-3. 
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- 2-10 1 2 


Residuals 

Figure 6-19 Normal probability plot of residuals for Example 6-3. 


require interpretation. Figure 6-19 is the normal probability plot of the residuals and 
Figure 6-20 is the plot of the residuals versus the predicted advance rate from the model 
containing the identified factors. There are clearly problems with normality and equality 
of variance. A data transformation is often used to deal with such problems. Because the 
response variable is a rate, the log transformation seems a reasonable candidate. 

Figure 6-21 presents a normal probability plot of the effect estimates following the 



Predicted advance rate 

Figure 6-20 Plot of residuals versus predicted rate for Example 6-3. 
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Effect estimate 


Figure 6-21 Normal probability plot of effects for Example 6-3 following log 
transformation. 


transformation y* = In y. Notice that a much simpler interpretation now seems possible, 
because only factors B, C, and D are active. That is, expressing the data in the correct 
metric has simplified its structure to the point that the two interactions are no longer 
required in the explanatory model. 

Figures 6-22 and 6-23 present, respectively, a normal probability plot of the residuals 



Residuals 


Figure 6-22 Normal probability plot of residuals for Example 6-3 following log 
transformation. 
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Predicted log advance rate 

Figure 6-23 Plot of residuals versus predicted rate for Example 6-3 fol¬ 
lowing log transformation. 


and a plot of the residuals versus the predicted advance rate for the model in the log 
scale containing B, C, and D. These plots are now satisfactory. We conclude that the 
model for y* = In y requires only factors B, C, and D for adequate interpretation. The 
analysis of variance for this model is summarized in Table 6-14. The model sum of 
squares is 

SS Mode i = SS B + SS C + SS D 

= 5.345 + 1.339 + 0.431 
= 7.115 

and R 2 = SS Mod JSS T = 7.115/7.288 = 0.98, so the model explains about 98 percent of 
the variability in the drill advance rate. 


EXAMPLE 6-4 . 

Location and Dispersion Effects in an Unreplicated Factorial 

A 2 4 design was run in a manufacturing process producing interior side-wall and window 
panels for commercial aircraft. The panels are formed in a press, and under present 


Table 6-14 Analysis of Variance for Example 6-3 Following 
the Log Transformation 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

F-Value 

B (Flow) 

5.345 

1 

5.345 

381.79 

<0.0001 

C (Speed) 

1.339 

1 

1.339 

95.64 

<0.0001 

D (Mud) 

0.431 

1 

0.431 

30.79 

<0.0001 

Error 

0.173 

12 

0.014 



Total 

7.288 

15 
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Factors Low (-) 

A = Temperature (°F) 295 

B = Clamp time (min) 7 

C = Resin flow 10 

D = Closing time (s) 15 

D 



High (+) 
325 
9 

20 

30 



Figure 6-24 Data for the panel process experiment of Example 6-4. 


conditions the average number of defects per panel in a press load is much too high. 
(The current process average is 5.5 defects per panel.) Four factors are investigated using 
a single replicate of a 2 4 design, with each replicate corresponding to a single press load. 
The factors are temperature (A), clamp time (B), resin flow (C), and press closing time 
(. D ). The data for this experiment are shown in Figure 6-24. 

A normal probability plot of the factor effects is shown in Figure 6-25. Clearly the 
two largest effects are A = 5.75 and C = -4.25. No other factor effects appear to be 
large, and A and C explain about 77 percent of the total variability, so we conclude that 
lower temperature (A) and higher resin flow (C) would reduce the incidence of panel 
defects. 

Careful residual analysis is an important aspect of any experiment. A normal prob¬ 
ability plot of the residuals showed no anomalies, but when the experimenter plotted the 
residuals versus each of the factors A through D, the plot of residuals versus B (clamp 



Factor effects 


Figure 6-25 Normal probability plot of the factor effects for the panel process ex¬ 
periment of Example 6-4. 
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i 


V 

• B = Clamp time 


V 




Figure 6-26 Plot of residuals versus clamp time for Example 
6-4. 

time) presented the pattern in Figure 6-26. This factor, which is unimportant insofar as 
the average number of defects per panel is concerned, is very important in its effect on 
process variability, with the lower clamp time resulting in less variability in the average 
number of defects per panel in a press load. 

The dispersion effect of clamp time is also very evident from the cube plot in Figure 
6-27, which plots the average number of defects per panel and the range of the number 
of defects at each point in the cube defined by factors A, B, and C. The average range 
when B is at the high level (the back face of the cube in Figure 6-27) is R B + = 4.75, and 
when B is at the low level it is R B - = 1.25. 

As a result of this experiment, the engineer decided to run the process at low tem¬ 
perature and high resin flow to reduce the average number of defects, at low clamp time 
to reduce the variability in the number of defects per panel, and at low press closing time 
(which had no effect on either location or dispersion). The new set of operating conditions 
resulted in a new process average of less than one defect per panel. 


The residuals from a 2* design provide much information about the problem under 
study. Because residuals can be thought of as observed values of the noise or error, they 
often give insight into process variability. We can systematically examine the residuals 
from an unreplicated 2 k design to provide information about process variability. 



Figure 6-27 Cube plot of temperature, clamp time, and resin flow for Example 
6-4. 





pie 6-4 
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Consider the residual plot in Figure 6-26. The standard deviation of the eight resid¬ 
uals where B is at the low level is S(B) = 0.83, and the standard deviation of the eight 
residuals where B is at the high level is S(B + ) = 2.72. The statistic 


Ft = hi 


S 2 (B + ) 

S 2 (B) 


(6-24) 


has an approximate normal distribution if the two variances cr 2 (B + ) and a 2 (B ) are equal. 
To illustrate the calculations, the value of Ft is 


. S 2 (B + ) 
S 2 (B~) 
(2.72) 2 
(0.83) 2 
= 2.37 


Table 6-15 (on the previous page) presents the complete set of contrasts for the 2 4 
design along with the residuals for each run from the panel process experiment in Ex¬ 
ample 6-4. Each column in this table contains an equal number of plus and minus signs, 
and we can calculate the standard deviation of the residuals for each group of signs in 
each column, say S(i + ) and S(i~), i = 1,2,..., 15. Then 

Ff = In i = 1, 2,.... 15 (6-25) 

is a statistic that can be used to assess the magnitude of the dispersion effects in the 
experiment. If the variance of the residuals for the runs where factor i is positive equals 
the variance of the residuals for the runs where factor i is negative, then Ff has an 
approximate normal distribution. The values of Ff are shown below each column in 
Table 6-15. 

Figure 6-28 is a normal probability plot of the dispersion effects Ff. Clearly, B is 
an important factor with respect to process dispersion. For more discussion of this pro- 



F* 


Figure 6-28 Normal probability plot of the dispersion effects F* for Ex¬ 
ample 6-4. 




6-5 A SINGLE REPLICATE OF THE 2 k DESIGN 


265 


cedure, see Box and Meyer (1986) and Myers and Montgomery (1995). Also, in order 
for the model residuals to properly convey information about dispersion effects, the 
location model must be correctly specified. Refer to the supplemental text material for 
this chapter for more details and an example. 


EXAMPLE 6-5 . 

Duplicate Measurements on the Response 

A team of engineers at a semiconductor manufacturer ran a 2 4 factorial design in a vertical 
oxidation furnace. Four wafers are “stacked” in the furnace, and the response variable 
of interest is the oxide thickness on the wafers. The four design factors are temperature 
(A), time ( B ), pressure (C), and gas flow (D). The experiment is conducted by loading 
four wafers into the furnace, setting the process variables to the test conditions required 
by the experimental design, processing the wafers, and then measuring the oxide thick¬ 
ness on all four wafers. Table 6-16 presents the design and the resulting thickness mea¬ 
surements. In this table, the four columns labeled “Thickness” contain the oxide 
thickness measurements on each individual wafer, and the last two columns contain the 
sample average and sample variance of the thickness measurements on the four wafers 
in each run. 

The proper analysis of this experiment is to consider the individual wafer thickness 
measurements as duplicate measurements, and not as replicates. If they were really 
replicates, each wafer would have been processed individually on a single run of the 
furnace. However, because all four wafers were processed together, they received the 
treatment factors (that is, the levels of the design variables) simultaneously, so there is 
much less variability in the individual wafer thickness measurements than would have 
been observed if each wafer was a replicate. Therefore, the average of the thickness 
measurements is the correct response variable to initially consider. 

Table 6-17 (page 266) presents the effect estimates for this experiment, using the 
average oxide thickness y as the response variable. Note that factors A and B and the AB 
interaction have large effects that together account for nearly 90 percent of the variability 


Table 6-16 
Standard 
Order 


2 

3 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


The Oxide Thickness Experiment 
Run 

Order A B C D 


10 

7 

3 
9 
6 
2 
5 

4 
12 
16 

8 
1 

14 

15 
11 
13 


Thickness y s 2 


378 376 379 

415 416 416 

380 379 382 

450 446 449 

375 371 373 

391 390 388 

384 385 386 

426 433 430 

381 381 375 

416 420 412 

371 372 371 

445 448 443 

377 377 379 

391 391 386 

375 376 376 

430 430 428 


379 378 2 

417 416 0.67 

383 381 3.33 

447 448 3.33 

369 372 6.67 

391 390 2 

385 385 0.67 

431 430 8.67 

383 380 12.00 

412 415 14.67 

370 371 0.67 

448 446 6 

379 378 1.33 

400 392 34 

377 376 0.67 

428 429 1.33 
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Table 6-17 Effect Estimates for Example 6-5, Response Variable Is 


Average Oxide Thickness 


Model 

Term 

Effect 

Estimate 

Sum of 
Squares 

Percent 

Contribution 

A 

43.125 

7439.06 

67.9339 

B 

18.125 

1314.06 

12.0001 

C 

-10.375 

430.562 

3.93192 

D 

-1.625 

10.5625 

0.0964573 

AB 

16.875 

1139.06 

10.402 

AC 

-10.625 

451.563 

4.12369 

AD 

1.125 

5.0625 

0.046231 

BC 

3.875 

60.0625 

0.548494 

BD 

-3.875 

60.0625 

0.548494 

CD 

1.125 

5.0625 

0.046231 

ABC 

-0.375 

0.5625 

0.00513678 

ABD 

2.875 

33.0625 

0.301929 

ACD 

-0.125 

0.0625 

0.000570753 

BCD 

-0.625 

1.5625 

0.0142688 

ABCD 

0.125 

0.0625 

0.000570753 


in average oxide thickness. Figure 6-29 is a normal probability plot of the effects. From 
examination of this display, we would conclude that factors A, B, and C and the AB and 
AC interactions are important. The analysis of variance display for this model is shown 
in Table 6-18. 

The model for predicting average oxide thickness is 

5> = 399.19 + 21.56jc, + 9.06jc 2 - 5.19x 3 + 8.44x,x 2 - 5.31*!*;, 

The residual analysis of this model is satisfactory. 

The experimenters are interested in obtaining an average oxide thickness of 400 A, 
and product specifications require that the thickness must lie between 390 and 410 A. 
Figure 6-30 (page 268) presents two contour plots of average thickness, one with factor 
C (or x 3 ), pressure, at the low level (that is, x 3 = -1) and the other with C (or x 3 ) at the 
high level (that is, x 3 = +1). From examining these contour plots, it is obvious that there 
are many combinations of time and temperature (factors A and B) that will produce 
acceptable results. However, if pressure is held constant at the low level, the operating 
“window” is shifted toward the left, or lower, end of the time axis, indicating that lower 
cycle times will be required to achieve the desired oxide thickness. 

It is interesting to observe the results that would be obtained if we incorrectly 
consider the individual wafer oxide thickness measurements as replicates. Table 6-19 
(page 269) presents a full model analysis of variance based on treating the experiment 
as a replicated 2 4 factorial. Notice that there are many significant factors in this analysis, 
suggesting a much more complex model than we found when using the average oxide 
thickness as the response. The reason for this is that the estimate of the error variance in 
Table 6-19 is too small (a 2 = 6.12). The residual mean square in Table 6-19 reflects the 
variability between wafers within a run and variability between runs. The estimate of 
error obtained from Table 6-18 is much larger, & 2 = 17.61, and it is primarily a measure 
of the between-run variability. This is the best estimate of error to use in judging the 
significance of process variables that are changed from run to run. 

A logical question to ask is: What harm results from identifying too many factors as 
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Effect 

Figure 6-29 Normal probability plot of the effects for the average oxide thickness 
response, Example 6-5. 


Table 6-18 Analysis of Variance (From Design-Expert) for the Average Oxide Thickness 
Response, Example 6-5 



Sum of 

Mean 

F 


Source 

Squares 

DF Square 

Value 

Prob > F 

Model 

10774.31 

5 2154.86 

122.35 

<0.000 

A 

7439.06 

1 7439.06 

422.37 

<0.000 

B 

1314.06 

1 1314.06 

74.61 

<0.000 

C 

430.56 

1 430.56 

24.45 

0.0006 

AB 

1139.06 

1 1139.06 

64.67 

<0.000 

AC 

451.56 

1 451.56 

25.64 

0.0005 

Residual 

176.12 

10 17.61 



Cor Total 

10950.44 

15 



Std. Dev. 

4.20 

ff-Squared 

0.9839 


Mean 

399.19 

Adj. /?-Squared 

0.9759 


C.V. 

1.05 

Pred. ff-Squared 

0.9588 


PRESS 

450.88 

Adeq. Precision 

27.967 



Coefficient 

Standard 

95% Cl 

95% Cl 

Factor 

Estimate 

DF Error 

Low 

High 

Intercept 

399.19 

1 1.05 

396.85 

401.53 

A-Time 

21.56 

1 1.05 

19.22 

23.90 

B-Temp 

9.06 

1 1.05 

6.72 

11.40 

C-Pressure 

-5.19 

1 1.05 

-7.53 

-2.85 

AB 

8.44 

1 1.05 

6.10 

10.78 

AC 

-5.31 

1 1.05 

-7.65 

-2.97 
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Figure 6-30 Contour plots of average oxide thickness with pressure (x 3 ) 
held constant. 


important? as the incorrect analysis in Table 6-19 would certainly do. The answer is that 
trying to manipulate or optimize the unimportant factors would be a waste of resources, 
and it could result in adding unnecessary variability to other responses of interest. 

When there are duplicate measurements on the response, there is almost always 
useful information about some aspect of process variability contained in these observa- 
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Table 6-19 Analysis of Variance (From Design-Expert) of the Individual Wafer Oxide 
Thickness Response 


Source 

Sum of 
Squares 

DF 

Mean 

Square 

F 

Value 

Prob > F 

Model 

43801.75 

15 

2920.12 

476.75 

<0.0001 

A 

29756.25 

1 

29756.25 

4858.16 

<0.0001 

B 

5256.25 

1 

5256.25 

858.16 

<0.0001 

C 

1722.25 

1 

1722.25 

281.18 

<0.0001 

D 

42.25 

1 

42.25 

6.90 

0.0115 

AB 

4556.25 

1 

4556.25 

743.88 

<0.0001 

AC 

1806.25 

1 

1806.25 

294.90 

<0.0001 

AD 

20.25 

1 

20.25 

3.31 

0.0753 

BC 

240.25 

1 

240.25 

39.22 

<0.0001 

BD 

240.25 

1 

240.25 

39.22 

<0.0001 

CD 

20.25 

1 

20.25 

3.31 

0.0753 

ABD 

132.25 

1 

132.25 

21.59 

<0.0001 

ABC 

2.25 

1 

2.25 

0.37 

0.5473 

ACD 

0.25 

1 

0.25 

0.041 

0.8407 

BCD 

6.25 

1 

6.25 

1.02 

0.3175 

ABCD 

0.25 

1 

0.25 

0.041 

0.8407 

Residual 

294.00 

48 

6.12 



Lack of Fit 

0.000 

0 




Pure Error 

294.00 

48 

6.13 



Cor. Total 

44095.75 

63 





tions. For example, if the duplicate measurements are multiple tests by a gauge on the 
same experimental unit, then the duplicate measurements give some insight about gauge 
capability. If the duplicate measurements are made at different locations on an experi¬ 
mental unit, they may give some information about the uniformity of the response variable 
across that unit. In our example, because we have one observation on each of four 
experimental units that have undergone processing together, we have some information 
about the within-run variability in the process. This information is contained in the vari¬ 
ance of the oxide thickness measurements from the four wafers in each run. It would be 
of interest to determine if any of the process variables influence the within-run variability. 

Figure 6-31 (page 270) is a normal probability plot of the effect estimates obtained 
using ln(s 2 ) as the response. Recall from Chapter 3 that we indicated that the log trans¬ 
formation is generally appropriate for modeling variability. There are not any strong 
individual effects, but factor A and the BD interaction are the largest. If we also include 
the main effects of B and D to obtain a hierarchical model, then the model for In (s 2 ) is 

frCs 2 ) = 1.08 + 0.41.x, - 0.40x 2 + 0.20*4 - 0.56* 2 * 4 

The model accounts for just slightly less than half of the variability in the In (s 2 ) response, 
which is certainly not spectacular as empirical models go, but it is often difficult to obtain 
exceptionally good models of variances. 

Figure 6-32 (page 270) is a contour plot of the predicted variance (not the log of the 
predicted variance) with pressure * 3 at the low level (recall that this minimizes cycle 
time) and gas flow * 4 at the high level. T hi s choice of gas flow gives the lowest values 
of predicted variance in the region of the contour plot. 

The experimenters here were interested in selecting values of the design variables 
that gave a mean oxide thickness within the process specifications and as close to 400 
A as possible, while simultaneously making the within-run variability small, say s 2 2. 
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Effect 

Figure 6-31 Normal probability plot of the effects using in (s 2 ) 
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-1.00 -0.50 0.00 0.50 1.00 

Figure 6-33 Overlay of the average oxide thickness and s 2 responses with 
pressure at the low level and gas flow at the high level. 


One possible way to find a suitable set of conditions is to overlay the contour plots in 
Figures 6-30 and 6-32. The overlay plot is shown in Figure 6-33, with the specifications 
on mean oxide thickness and the constraint s 2 =£ 2 shown as contours. In this plot, 
pressure is held constant at the low level and gas flow is held constant at the high level. 
The open region near the upper left center of the graph identifies a feasible region for 
the variables time and temperature. 

This is a simple example of using contour plots to study two responses simulta¬ 
neously. We will discuss this problem in more detail in Chapter 11. 


6-6 THE ADDITION OF CENTER POINTS TO THE 2 k DESIGN 

A potential concern in the use of two-level factorial designs is the assumption of linearity 
in the factor effects. Of course, perfect linearity is unnecessary, and the 2* system will 
work quite well even when the linearity assumption holds only very approximately. In 
fact, we have noted that if interaction terms are added to a main effects or first-order 
model, resulting in 

y = /3 0 + 2 PjXj + 22 PijXiXj + e (6-26) 

then we have a model capable of representing some curvature in the response function. 
This curvature, of course, results from the twisting of the plane induced by the interaction 
terms (i.jX.Xj. 
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There are going to be situations where the curvature in the response function is not 
adequately modeled by Equation 6-26. In such cases, a logical model to consider is 

y = 0 O + 2 fijXj + 22 fax* + 2 M + e (6-27) 


where the /3„ represent pure second-order or quadratic effects. Equation 6-27 is called 

a second-order response surface model. 

In running a two-level factorial experiment, we usually anticipate fitting the first- 
order model in Equation 6-26, but we should be alert to the possibility that the second- 
order model in Equation 6-27 is really more appropriate. There is a method of replicating 
certain points in a 2* factorial that will provide protection against curvature from second- 
order effects as well as allow an independent estimate of error to be obtained. The method 
consists of adding center points to the 2* design. These consist of n replicates run at the 
points x t = 0 (i = 1,2,..., k). One important reason for adding the replicate runs at the 
design center is that center points do not affect the usual effect estimates in a 2 k design. 
When we add center points, we assume that the k factors are quantitative. 

To illustrate the approach, consider a 2 2 design with one observation at each of the 
factorial points (-, -), (+, -), (-, +) and (+, +) and n c observations at the center 
point (0, 0). Figure 6-34 illustrates the situation. Let y F be the average of the four runs 
at the four factorial points and let y c be the average of the n c runs at the center point. If 
the difference y F - y c is small, then the center points lie on or near the plane passing 
through the factorial points, and there is no quadratic curvature. On the other hand, if 
y F ~ yc is large, then quadratic curvature is present. A single-degree-of-freedom sum 
of squares for pure quadratic curvature is given by 


■SSpure 


n F n c (y F - y c f 

n F + n c 


(6-28) 


where, in general, n F is the number of factorial design points. This quantity may be 
compared to the error mean square to test for pure quadratic curvature. More specifically, 
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when points are added to the center of the 2 k design, then the test for curvature (using 
Equation 6-28) actually tests the hypotheses 

Ho - E - = 0 

Hi ' % Pjj* 0 

Furthermore, if the factorial points in the design are unreplicated, one may use the n c 
center points to construct an estimate of error with n c — 1 degrees of freedom. 


EXAMPLE 6-6 . 

A chemical engineer is studying the yield of a process. There are two variables of interest, 
reaction time and reaction temperature. Because she is uncertain about the assumption 
of linearity over the region of exploration, the engineer decides to conduct a 2 2 design 
(with a single replicate of each factorial run) augmented with five center points. The 
design and the yield data are shown in Figure 6-35. 

Table 6-20 summarizes the analysis of variance for this experiment. The mean square 
error is calculated from the center points as follows: 


MS e = 


SS E 
n c ~ 1 


O', - y) 


Thus, in Table 6-20 (page 274), 

5 

E (yt - 40.46) 2 

MS * = ~ -4- 

= 0.1720 
4 


(6-29) 


= 0.0430 



30 35 40 

A = Reaction time (min) 
Figure 6-35 The 2 2 design with five center 
points for Example 6-6. 
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Table 6-20 Analysis of Variance for Example 6-6 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

A (Time) 

2.4025 

1 

2.4025 

55.87 

0.0017 

B (Temperature) 

0.4225 

1 

0.4225 

9.83 

0.0350 

AB 

0.0025 

1 

0.0025 

0.06 

0.8185 

Pure quadratic 

0.0027 

1 

0.0027 

0.06 

0.8185 

Error 

0.1720 

4 

0.0430 



Total 

3.0022 

8 





The average of the points in the factorial portion of the design is y F = 40.425, and the 
average of the points at the center is y c = 40.46. The difference y F - y c = 40.425 - 
40.46 = -0.035 appears to be small. The pure quadratic curvature sum of squares in the 
analysis of variance table is computed from Equation 6-28 as follows: 

ss = n F n c (y F ~ y c f 

Pure quadratic + n<: 

(4)(5)(—0.035) 2 
4 + 5 
- 0.0027 

The analysis of variance indicates that both factors exhibit significant main effects, that 
there is no interaction, and that there is no evidence of second-order curvature in the 
response over the region of exploration. That is, the null hypothesis H 0 : 0 n + j8 22 = 0 
cannot be rejected. 


In Example 6-6, we concluded that there was no indication of quadratic effects; that 
is, a first-order model 

y = 00 + 01*1 + 02*2 + 012*1*2 + € 

is appropriate (although we probably don’t need the interaction term). There will be 
situations where the quadratic terms will be required. That is, we will now have to assume 
a second-order model such as 

y = 00 + 0i*i + 02*2 + 012 * 1*2 + 011*1 + 022*2 + e 
Unfortunately, we cannot estimate the unknown parameters (the 0’s) in this model be¬ 
cause there are six parameters to estimate and the 2 2 design plus center points in Figure 
6-35 only has five independent runs. 

A simple and highly effective solution to this problem is to augment the 2 2 design 
with four axial runs, as shown in Figure 6-36 a. The resulting design, called a central 
composite design, can now be used to fit the second-order model. Figure 6-36 b shows 
a central composite design for k = 3 factors. This design has 14 + n c runs (usually 
3 « c 5), and is a very efficient design for fitting the 10-parameter second-order 
model in k = 3 factors. 

Central composite designs are used extensively in building second-order response 
surface models. These designs will be discussed in more detail in Chapter 11. 
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Figure 6-36 Central composite designs. 


We conclude this section with a few additional useful suggestions and observations 
concerning the use of center points. 

1. When a factorial experiment is conducted in an ongoing process, consider using 
the current operating conditions (or recipe) as the center point in the design. 
This often assures the operating personnel that at least some of the runs in the 
experiment are going to be performed under familiar conditions, and so the 
results obtained (at least for these runs) are unlikely to be any worse than are 
typically obtained. 

2. When the center point in a factorial experiment correspond to the usual operating 
recipe, the experimenter can use the observed responses at the center point to 
provide a rough check of whether anything “unusual” occurred during the 
experiment. That is, the center point responses should be very similar to responses 
observed historically in routine process operation. Often operating personnel will 
maintain a control chart for monitoring process performance. Sometimes the 
center point responses can be plotted directly on the control chart as a check of 
the manner in which the process was operating during the experiment. 

3. Consider running the replicates at the center point in nonrandom order. 
Specifically, run one or two center points at or near the beginning of the 
experiment, one or two near the middle, and one or two near the end. By 
spreading the center points out in time, the experimenter has a rough check on 
the stability of the process during the experiment. For example, if a trend has 
occurred in the response while the experiment was performed, plotting the center 
point responses versus time order may reveal this. 

4. Sometimes experiments must be conducted in situations where there is little or 
no prior information about process variability. In these cases, running two or 
three center points as the first few runs in the experiment can be very helpful. 
These runs can provide a preliminary estimate of variability. If the magnitude 
of the variability seems reasonable, continue; on the other hand, if larger than 
anticipated (or reasonable!) variability is observed, stop. Often it will be very 
profitable to study the question of why the variability is so large before 
proceeding with the rest of the experiment. 

5. Usually, center points are employed when all design factors are quantitative. 
However, sometimes there will be one or more qualitative or categorical variables 
and several quantitative ones. Center points can still be employed in these cases. 
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center points. 


To illustrate, consider an experiment with two quantitative factors, time and 
temperature, each at two levels, and a single qualitative factor, catalyst type, also 
with two levels (organic and nonorganic). Figure 6-37 shows the 2 3 design for 
these factors. Notice that the center points are placed in the opposed faces of the 
cube which involve the quantitative factors. In other words, the center points can 
be run at the high and low level treatment combinations of the qualitative factors, 
just so long as those subspaces involves only quantitative factors. 

6-7 PROBLEMS --- 

6-1. An engineer is interested in the effects of cutting speed (A), tool geometry (B), and cutting 
angle (C) on the life (in hours) of a machine tool. Two levels of each factor are chosen, 
and three replicates of a 2 3 factorial design are run. The results follow: 


A 

B 

C 

Treatment 

Combination 


Replicate 


I 

II 

m 

- 

- 

- 

(1) 

22 

31 

25 

+ 

- 

- 

a 

32 

43 

29 

- 

+ 

- 

b 

35 

34 

50 

+ 

+ 

- 

ab 

55 

47 

46 

- 

- 

+ 

c 

44 

45 

38 

+ 

- 

+ 

ac 

40 

37 

36 

- 

+ 

+ 

be 

60 

50 

54 

+ 

+ 

+ 

abc 

39 

41 

47 


(a) Estimate the factor effects. Which effects appear to be large? 

(b) Use the analysis of variance to confirm your conclusions for part (a). 

(c) Write down a regression model for predicting tool life (in hours) based on the results 
of this experiment. 

(d) Analyze the residuals. Are there any obvious problems? 

(e) Based on an analysis of main effect and interaction plots, what levels of A, B, and 
C would you recommend using? 

6-2. Reconsider part (c) of Problem 6-1. Use the regression model to generate response surface 
and contour plots of the tool life response. Interpret these plots. Do they provide insight 
regarding the desirable operating conditions for this process? 

6-3. Find the standard error of the factor effects and approximate 95 percent confidence limits 
for the factor effects in Problem 6-1. Do the results of this analysis agree with the con¬ 
clusions from the analysis of variance? 
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6-4. Plot the factor effects from Problem 6-1 on a graph relative to an appropriately scaled t 
distribution. Does this graphical display adequately identify the important factors? Com¬ 
pare the conclusions from this plot with the results from the analysis of variance. 

6-5. A router is used to cut locating notches on a printed circuit board. The vibration level at 
the surface of the board as it is cut is considered to be a major source of dimensional 
variation in the notches. Two factors are thought to influence vibration: bit size (A) and 
cutting speed (B). Two bit sizes and § inch) and two speeds (40 and 90 rpm) are 
selected, and four boards are cut at each set of conditions shown below. The response 
variable is vibration measured as the resultant vector of three accelerometers (x, y, and z) 
on each test circuit board. 


A 

B 

Treatment 

Combination 


Replicate 


I 

II 

III 

IV 

- 

- 

(1) 

18.2 

18.9 

12.9 

14.4 

+ 

- 

a 

27.2 

24.0 

22.4 

22.5 

- 

+ 

b 

15.9 

14.5 

15.1 

14.2 

+ 

+ 

ab 

41.0 

43.9 

36.3 

39.9 


(a) Analyze the data from this experiment. 

(b) Construct a normal probability plot of the residuals, and plot the residuals versus the 
predicted vibration level. Interpret these plots. 

(c) Draw the AB interaction plot. Interpret this plot. What levels of bit size and speed 
would you recommend for routine operation? 

6-6. Reconsider the experiment described in Problem 6-1. Suppose that the experimenter only 
performed the eight trials from replicate I. In addition, he ran four center points and 
obtained the following response values: 36, 40,43, 45. 

(a) Estimate the factor effects. Which effects are large? 

(b) Perform an analysis of variance, including a check for pure quadratic curvature. What 
are your conclusions? 

(c) Write down an appropriate model for predicting tool life, based on the results of this 
experiment. Does this model differ in any substantial way from the model in Problem 
6-1, part (c)? 

(d) Analyze the residuals. 

(e) What conclusions would you draw about the appropriate operating conditions for 
this process? 

6-7. An experiment was performed to improve the yield of a chemical process. Four factors 
were selected, and two replicates of a completely randomized experiment were run. The 
results are shown in the following table: 


Treatment 

Combination 

( 1 ) 

b 

ab 


be 


Replicate 

_I_n_ 

90 93 

74 78 

81 85 

83 80 

77 78 

81 80 

88 82 

73 70 


Treatment 

Combination 

d 

ad 

bd 

abd 

cd 

acd 

bed 

abed 


Replicate 
I II 

98 95 

72 76 

87 83 

85 86 

99 90 

79 75 

87 84 

80 80 
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(a) Estimate the factor effects. 

(b) Prepare an analysis of variance table, and determine which factors are important in 
explaining yield. 

(c) Write down a regression model for predicting yield, assuming that all four factors 
were varied over the range from — 1 to +1 (in coded units). 

(d) Plot the residuals versus the predicted yield and on a normal probability scale. Does 
the residual analysis appear satisfactory? 

(e) Two three-factor interactions, ABC and ABD, apparently have large effects. Draw a 
cube plot in the factors A, B, and C with the average yields shown at each comer. 
Repeat using the factors A, B, and D. Do these two plots aid in data interpretation? 
Where would you recommend that the process be run with respect to the four 
variables? 

6-8. A bacteriologist is interested in the effects of two different culture media and two different 

times on the growth of a particular vims. She performs six replicates of a 2 2 design, making 
the runs in random order. Analyze the bacterial growth data that follow and draw appro¬ 
priate conclusions. Analyze the residuals and comment on the model’s adequacy. 


Time, h 


Culture Medium 



1 

2 



21 

22 

25 

26 

12 

23 

28 

24 

25 


20 

26 

29 

27 


37 

39 

31 

34 

18 

38 

38 

29 

33 


35 

36 

30 

35 


6-9. An industrial engineer employed by a beverage bottler is interested in the effects of two 
different types of 32-ounce bottles on the time to deliver 12-bottle cases of the product. 
The two bottle types are glass and plastic. Two workers are used to perform a task con¬ 
sisting of moving 40 cases of the product 50 feet on a standard type of hand truck and 
stacking the cases in a display. Four replicates of a 2 2 factorial design are performed, and 
the times observed are listed in the following table. Analyze the data and draw appropriate 
conclusions. Analyze the residuals and comment on the model’s adequacy. 


Bottle Type 



Worker 


: 



2 

Glass 

5.12 

4.89 

6.65 

6.24 


4.98 

5.00 

5.49 

5.55 

Plastic 

4.95 

4.43 

5.28 

4.91 


4.27 

4.25 

4.75 

4.71 


In Problem 6-9, the engineer was also interested in potential fatigue differences resulting 
from the two types of bottles. As a measure of the amount of effort required, he measured 
the elevation of the heart rate (pulse) induced by the task. The results follow. Analyze the 
data and draw conclusions. Analyze the residuals and comment on the model’s adequacy. 


6 - 10 . 
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Bottle Type 


Worker 


: 

1 

2 


Glass 

39 

45 

20 

13 


58 

35 

16 

11 

Plastic 

44 

35 

13 

10 


42 

21 

16 

15 


6-11. Calculate approximate 95 percent confidence limits for the factor effects in Problem 6- 
10. Do the results of this analysis agree with the analysis of variance performed in Problem 
6 - 10 ? 

6-12. An article in the AT&T Technical Journal (March/April 1986, Vol. 65, pp. 39-50) de¬ 
scribes the application of two-level factorial designs to integrated circuit manufacturing. 
A basic processing step is to grow an epitaxial layer on polished silicon wafers. The wafers 
mounted on a susceptor are positioned inside a bell jar, and chemical vapors are intro¬ 
duced. The susceptor is rotated and heat is applied until the epitaxial layer is thick enough. 
An experiment was run using two factors: arsenic flow rate (A) and deposition time (B). 
Four replicates were run, and the epitaxial layer thickness was measured (in /urn). The 
data are shown below: 


A 

B 


Replicate 


Factor Levels 

I 

II 

III 

IV 

Low (-) 

High (+) 

- 

- 

14.037 

16.165 

13.972 

13.907 A 

55% 

59% 

+ 

- 

13.880 

13.860 

14.032 

13.914 



- 

+ 

14.821 

14.757 

14.843 

14.878 B 

Short 

Long 

+ 

+ 

14.888 

14.921 

14.415 

14.932 

(10 min) 

(15 min) 


(a) Estimate the factor effects. 

(b) Conduct an analysis of variance. Which factors are important? 

(c) Write down a regression equation that could be used to predict epitaxial layer thick¬ 
ness over the region of arsenic flow rate and deposition time used in this experiment. 

(d) Analyze the residuals. Are there any residuals that should cause concern? 

(e) Discuss how you might deal with the potential outlier found in part (d). 

6-13. Continuation of Problem 6-12. Use the regression model in part (c) of Problem 6-12 to 
generate a response surface contour plot for epitaxial layer thickness. Suppose it is criti¬ 
cally important to obtain layer thickness of 14.5 jam. What settings of arsenic flow rate 
and decomposition time would you recommend? 

6-14. Continuation of Problem 6-13. How would your answer to Problem 6-13 change if arsenic 

flow rate was more difficult to control in the process than the deposition time? 

6-15. A nickel-titanium alloy is used to make components for jet turbine aircraft engines. Crack¬ 

ing is a potentially serious problem in the final part, because it can lead to nonrecoverable 
failure. A test is run at the parts producer to determine the effect of four factors on cracks. 
The four factors are pouring temperature (A), titanium content (B), heat treatment method 
(C), and amount of grain refiner used (D). Two replicates of a 2 4 design are run, and the 
length of crack (in mm X 10 2 ) induced in a sample coupon subjected to a standard test 
is measured. The data are shown in the following table: 
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A 

B 

C 

D 

Treatment 

Combination 

Replicate 

I 

II 

_ 

- 

- 

- 

(1) 

7.037 

6.376 

+ 

- 

- 

- 

a 

14.707 

15.219 

- 

+ 

- 

- 

b 

11.635 

12.089 

+ 

+ 

- 

- 

ab 

17.273 

17.815 

_ 

- 

+ 

- 

C 

10.403 

10.151 

+ 

_ 

+ 

- 

ac 

4.368 

4.098 

_ 

+ 

+ 

- 

be 

9.360 

9.253 

+ 

+ 

+ 

- 

abc 

13.440 

12.923 

_ 

_ 

- 

+ 

d 

8.561 

8.951 

+ 

_ 

- 

+ 

ad 

16.867 

17.052 

_ 

+ 

- 

+ 

bd 

13.876 

13.658 

+ 

+ 

- 

+ 

abd 

19.824 

19.639 

- 

- 

+ 

+ 

cd 

11.846 

12.337 

+ 

- 

+ 

+ 

acd 

6.125 

5.904 

- 

+ 

+ 

+ 

bed 

11.190 

10.935 

+ 

+ 

+ 

+ 

abed 

15.653 

15.053 


(a) Estimate the factor effects. Which factor effects appear to be large? 

(b) Conduct an analysis of variance. Do any of the factors affect cracking? Use a = 

0.05. 

(c) Write down a regression model that can be used to predict crack length as a function 
of the significant main effects and interactions you have identified in part (b). 

(d) Analyze the residuals from this experiment. 

(e) Is there an indication that any of the factors affect the variability in cracking? 

(f) What recommendations would you make regarding process operations? Use inter¬ 
action and/or main effect plots to assist in drawing conclusions. 

6-16. Continuation of Problem 6-15. One of the variables in the experiment described in Prob¬ 
lem 6-15, heat treatment method (C), is a categorical variable. Assume that the remaining 

factors are continuous. 

(a) Write two regression models for predicting crack length, one for each level of the 
heat treatment method variable. What differences, if any, do you notice in these two 
equations? 

(b) Generate appropriate response surface contour plots for the two regression models 
in part (a). 

(c) What set of conditions would you recommend for the factors A, B, and D if you use 
heat treatment method C = +? 

(d) Repeat part (c) assuming that you wish to use heat treatment method C = —. 

6-17. An experimenter has run a single replicate of a 2 4 design. The following effect estimates 

have been calculated: 


A = 76.95 

AB = -51.32 

ABC = -2.82 

B = -67.52 

AC = 11.69 

ABD = -6.50 

C = -7.84 

AD = 9.78 

ACD = 10.20 

D = -18.73 

BC = 20.78 

BCD = -7.98 


BD = 14.74 

ABCD = -6.25 


CD = 1.27 


(a) Construct a normal probability plot of these effects. 


(b) Identify a tentative model, based on the plot of the effects in part (a). 
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6-18. An article in Solid State Technology (“Orthogonal Design for Process Optimization and 
Its Application in Plasma Etching,” May 1987, pp. 127-132) describes the application 
of factorial designs in developing a nitride etch process on a single-wafer plasma etcher. 
The process uses C 2 F 6 as the reactant gas. Four factors are of interest: anode-cathode gap 
(A), pressure in the reactor chamber (B), C 2 F 6 gas flow (C), and power applied to the 
cathode (D). The response variable of interest is the etch rate for silicon nitride. A single 
replicate of a 2 4 design is run, and the data are shown below: 


Run 

Number 

Actual 

Run 

Order 

A 

B 

C 

D 

Etch 

Rate 

(A/min) 

Factor Levels 

Low (—) 

High (+) 

1 

13 

- 

- 

- 

- 

550 

A (cm) 0.80 

1.20 

2 

8 

+ 

- 

- 

- 

669 

B (mTorr) 450 

550 

3 

12 

- 

+ 

- 

- 

604 

C (SCCM) 125 

200 

4 

9 

+ 

+ 

- 

- 

650 

D (W) 275 

325 

5 

4 

- 

- 

+ 

- 

633 



6 

15 

+ 

- 

+ 

- 

642 



7 

16 

- 

+ 

+ 

- 

601 



8 

3 

+ 

+ 

+ 

- 

635 



9 

1 

- 

- 

- 

+ 

1037 



10 

14 

+ 

- 

- 

+ 

749 



11 

5 

- 

+ 

- 

+ 

1052 



12 

10 

+ 

+ 

- 

+ 

868 



13 

11 

- 

- 

+ 

+ 

1075 



14 

2 

+ 

- 

+ 

+ 

860 



15 

7 

- 

+ 

+ 

+ 

1063 



16 

6 

+ 

+ 

+ 

+ 

729 




(a) Estimate the factor effects. Consider a normal probability plot of the factor effects. 
Which effects appear large? 

(b) Conduct an analysis of variance to confirm your findings for part (a). 

(c) What is the regression model relating etch rate to the significant process variables? 

(d) Analyze the residuals from this experiment. Comment on the model’s adequacy. 

(e) If not all the factors are important, project the 2 4 design into a 2 k design with k < 4 
and conduct the analysis of variance. 

(f) Draw graphs to interpret any significant interactions. 

(g) Plot the residuals versus the actual run order. What problems might be revealed by 
this plot? 

6-19. Continuation of Problem 6-18. Consider the regression model obtained in part (c) of 
Problem 6-18. 

(a) Construct contour plots of the etch rate using this model. 

(b) Suppose that it was necessary to operate this process at an etch rate of 800 A/min. 
What settings of the process variables would you recommend? 

6-20. Consider the single replicate of the 2 4 design in Example 6-2. Suppose we had arbitrarily 
decided to analyze the data assuming that all three- and four-factor interactions were 
negligible. Conduct this analysis and compare your results with those obtained in the 
example. Do you think that it is a good idea to arbitrarily assume interactions to be 
negligible even if they are relatively high-order ones? 

6-21. An experiment was run in a semiconductor fabrication plant in an effort to increase yield. 
Five factors, each at two levels, were studied. The factors (and levels) were A = aperture 
setting (small, large), B = exposure time (20% below nominal, 20% above nominal). 
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C = development time (30 s, 45 s), D = mask dimension (small, large), and E = etch 
time (14.5 min, 15.5 min). The unreplicated 2 s design shown below was run. 

(1) = 7 d = 8 <? = 8 de = 6 

a = 9 ad = 10 ae= 12 ade = 10 

fo = 34 bd = 32 foe = 35 forfe = 30 

afo = 55 afod = 50 afoe = 52 abde = 53 

c = 16 cd = 18 ce = 15 tie = 15 

ac = 20 acd = 21 ace = 22 acde = 20 

be = 40 bed = 44 bee = 45 bede = 41 

abc = 60 abed = 61 afoce = 65 abede = 63 

(a) Construct a normal probability plot of the effect estimates. Which effects appear to 
be large? 

(b) Conduct an analysis of variance to confirm your findings for part (a). 

(c) Write down the regression model relating yield to the significant process variables. 

(d) Plot the residuals on normal probability paper. Is the plot satisfactory? 

(e) Plot the residuals versus the predicted yields and versus each of the five factors. 
Comment on the plots. 

(f) Interpret any significant interactions. 

(g) What are your recommendations regarding process operating conditions? 

(h) Project the 2 5 design in this problem into a 2* design in the important factors. Sketch 
the design and show the average and range of yields at each run. Does this sketch 
aid in interpreting the results of this experiment? 

6-22. Continuation of Problem 6-21. Suppose that the experimenter had run four center points 
in addition to the 32 trials in the original experiment. The yields obtained at the center 
point runs were 68, 74, 76, and 70. 

(a) Reanalyze the experiment, including a test for pure quadratic curvature. 

(b) Discuss what your next step would be. 

6-23. In a process development study on yield, four factors were studied, each at two levels: 
time (A), concentration (B), pressure (C), and temperature (D). A single replicate of a 2 4 
design was run, and the resulting data are shown in the following table: 


Run 

Number 

Actual 

Run 

Order 

A 

B 

C 

D 

Yield 

(lbs) 

Factor Levels 

Low (-) High (+) 

1 

5 

- 

- 

- 

- 

12 

A (h) 2.5 

3 

2 

9 

+ 

- 

- 

- 

18 

B (%) 14 

18 

3 

8 

- 

+ 

- 

- 

13 

C (psi) 60 

80 

4 

13 

+ 

+ 

- 

- 

16 

D (°C) 225 

250 

5 

3 

- 

- 

+ 

- 

17 



6 

7 

-I- 

- 

+ 

- 

15 



7 

14 

- 

+ 

+ 

- 

20 



8 

1 

+ 

+ 

+ 

- 

15 



9 

6 

- 

- 

- 

+ 

10 



10 

11 

+ 

- 

- 

+ 

25 



11 

2 

- 

+ 

- 

+ 

13 



12 

15 

+ 

+ 

- 

+ 

24 



13 

4 

- 

- 

+ 

+ 

19 



14 

16 

+ 

- 

+ 

+ 

21 



15 

10 

- 

+ 

+ 

+ 

17 



16 

12 

+ 

+ 

+ 

+ 

23 
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(a) Construct a normal probability plot of the effect estimates. Which factors appear to 
have large effects? 

(b) Conduct an analysis of variance using the normal probability plot in part (a) for 
guidance in forming an error term. What are your conclusions? 

(c) Write down a regression model relating yield to the important process variables. 

(d) Analyze the residuals from this experiment. Does your analysis indicate any potential 
problems? 

(e) Can this design be collapsed into a 2 3 design with two replicates? If so, sketch the 
design with the average and range of yield shown at each point in the cube. Interpret 
the results. 

6-24. Continuation of Problem 6-23. Use the regression model in part (c) of Problem 6-23 to 

generate a response surface contour plot of yield. Discuss the practical value of this re¬ 
sponse surface plot. 

6-25. The scrumptious brownie experiment. The author is an engineer by training and a firm 
believer in learning by doing. I have taught experimental design for many years to a wide 
variety of audiences and have always assigned the planning, conduct, and analysis of an 
actual experiment to the class participants. The participants seem to enjoy this practical 
experience and always learn a great deal from it. This problem uses the results of an 
experiment performed by Gretchen Krueger at Arizona State University. 

There are many different ways to bake brownies. The purpose of this experiment was 
to determine how the pan material, the brand of brownie mix, and the stirring method 
affect the scrumptiousness of brownies. The factor levels were 



Factor 

Low (—) 


High (+) 




A = pan material 

Glass 


Aluminum 




B = stirring method 

Spoon 


Mixer 





C = brand of mix 

Expensive 


Cheap 




The response variable was scrumptiousness, a subjective measure derived from a ques¬ 
tionnaire given to the subjects who sampled each batch of brownies. (The questionnaire 
dealt with such issues as taste, appearance, consistency, aroma, and so forth). An eight- 
person test panel sampled each batch and filled out the questionnaire. The design matrix 

and the response data are shown below: 









Test Panel Results 




Batch 

ABC 1 

2 3 

4 

5 

6 

7 

8 

1 

11 

9 10 

10 

11 

10 

8 

9 

2 

+ - - 15 

10 16 

14 

12 

9 

6 

15 

3 

- + - 9 

12 11 

11 

11 

11 

11 

12 

4 

+ + - 16 

17 15 

12 

13 

13 

11 

11 

5 

+ 10 

11 15 

8 

6 

8 

9 

14 

6 

+ - + 12 

13 14 

13 

9 

13 

14 

9 

7 

- + + 10 

12 13 

10 

7 

7 

17 

13 

8 

+ + + 15 

12 15 

13 

12 

12 

9 

14 


(a) Analyze the data from this experiment as if there were eight replicates of a 2 3 design. 
Comment on the results. 
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(b) Is the analysis in part (a) the correct approach? There are only eight batches; do we 
really have eight replicates of a 2 3 factorial design? 

(c) Analyze the average and standard deviation of the scrumptiousness ratings. Com¬ 
ment on the results. Is this analysis more appropriate than the one in part (a)? Why 
or why not? 

6-26. An experiment was conducted on a chemical process that produces a polymer. The four 
factors studied were temperature (A), catalyst concentration (B), time (C), and pressure 
(£>). Two responses, molecular weight and viscosity, were observed. The design matrix 
and response data are shown below: 


Run 

Number 

Actual 

Run 

Order 

A 

B 

c 

D 

Molecular 

Weight 

Viscosity 

Factor Levels 

Low (-) 

High (+) 

1 

18 

- 

- 

- 

- 

2400 

1400 

A (°C) 100 

120 

2 

9 

+ 

- 

- 

- 

2410 

1500 

B (%) 4 

8 

3 

13 

- 

+ 

- 

- 

2315 

1520 

C (min) 20 

30 

4 

8 

+ 

+ 

- 

- 

2510 

1630 

D (psi) 60 

75 

5 

3 

- 

- 

+ 

- 

2615 

1380 



6 

11 

+ 

- 

+ 

- 

2625 

1525 



7 

14 

- 

+ 

+ 

- 

2400 

1500 



8 

17 

+ 

+ 

+ 

- 

2750 

1620 



9 

6 

- 

- 

- 

+ 

2400 

1400 



10 

7 

+ 

- 

- 

+ 

2390 

1525 



11 

2 

- 

+ 

- 

+ 

2300 

1500 



12 

10 

+ 

+ 

- 

+ 

2520 

1500 



13 

4 

- 

- 

+ 

+ 

2625 

1420 



14 

19 

+ 

- 

+ 

+ 

2630 

1490 



15 

15 

- 

+ 

+ 

+ 

2500 

1500 



16 

20 

+ 

+ 

+ 

+ 

2710 

1600 



17 

1 

0 

0 

0 

0 

2515 

1500 



18 

5 

0 

0 

0 

0 

2500 

1460 



19 

16 

0 

0 

0 

0 

2400 

1525 



20 

12 

0 

0 

0 

0 

2475 

1500 




(a) Consider only the molecular weight response. Plot the effect estimates on a normal 
probability scale. What effects appear important? 

(b) Use an analysis of variance to confirm the results from part (a). Is there indication 
of curvature? 

(c) Write down a regression model to predict molecular weight as a function of the 
important variables. 

(d) Analyze the residuals and comment on model adequacy. 

(e) Repeat parts (a)-(d) using the viscosity response. 

6-27. Continuation of Problem 6-26. Use the regression models for molecular weight and vis¬ 
cosity to answer the following questions. 

(a) Construct a response surface contour plot for molecular weight. In what direction 
would you adjust the process variables to increase molecular weight? 

(b) Construct a response surface contour plot for viscosity. In what direction would you 
adjust the process variables to decrease viscosity? 
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(c) What operating conditions would you recommend if it was necessary to produce a 
product with molecular weight between 2400 and 2500, and the lowest possible 
viscosity? 

6-28. Consider the single replicate of the 2 4 design in Example 6-2. Suppose that we ran five 
points at the center (0, 0, 0, 0) and observed the following responses: 73, 75, 71, 69, and 
76. Test for curvature in this experiment. Interpret the results. 

6-29. A missing value in a 2 k factorial. It is not unusual to find that one of the observations in 
a 2 k design is missing due to faulty measuring equipment, a spoiled test, or some other 
reason. If the design is replicated n times (n > 1) some of the techniques discussed in 
Chapter 5 can be employed. However, for an unreplicated factorial (n = 1) some other 
method must be used. One logical approach is to estimate the missing value with a number 
that makes the highest-order interaction contrast zero. Apply this technique to the exper¬ 
iment in Example 6-2 assuming that run ab is missing. Compare the results with the results 
of Example 6-2. 

6-30. An engineer has performed an experiment to study the effect of four factors on the surface 
roughness of a machined part. The factors (and their levels) are A = tool angle (12, 15°), 
B = cutting fluid viscosity (300, 400), C = feed rate (10, 15 in/min), and D = cutting 
fluid cooler used (no, yes). The data from this experiment (with the factors coded to the 
usual —1, +1 levels) are shown below. 


Run 

A 

B 

C 

D 

Surface Roughness 

1 

- 

- 

- 

- 

0.00340 

2 

+ 

- 

- 

- 

0.00362 

3 

- 

+ 

- 

- 

0.00301 

4 

+ 

+ 

- 

- 

0.00182 

5 

- 

- 

+ 

- 

0.00280 

6 

+ 

- 

+ 

- 

0.00290 

7 

- 

+ 

+ 

- 

0.00252 

8 

+ 

+ 

+ 

- 

0.00160 

9 

- 

- 

- 

+ 

0.00336 

10 

+ 

- 

- 

+ 

0.00344 

11 

- 

+ 

- 

+ 

0.00308 

12 

+ 

+ 

- 

+ 

0.00184 

13 

- 

- 

+ 

+ 

0.00269 

14 

+ 

- 

+ 

+ 

0.00284 

15 

- 

+ 

+ 

+ 

0.00253 

16 

+ 

+ 

+ 

+ 

0.00163 


(a) Estimate the factor effects. Plot the effect estimates on a normal probability plot and 
select a tentative model. 

(b) Fit the model identified in part (a) and analyze the residuals. Is there any indication 
of model inadequacy? 

(c) Repeat the analysis from parts (a) and (b) using 1/y as the response variable. Is there 
an indication that the transformation has been useful? 

(d) Fit a model in terms of the coded variables that can be used to predict the surface 
roughness. Convert this prediction equation into a model in the natural variables. 

6-31. Resistivity on a silicon wafer is influenced by several factors. The results of a 2 4 factorial 
experiment performed during a critical processing step is shown in the following table: 
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Run 

A 

B 

C 

D 

Resistivity 

1 

- 

- 

- 

- 

1.92 

2 

+ 

- 

- 

- 

11.28 

3 

- 

+ 

- 

- 

1.09 

4 

+ 

+ 

- 

- 

5.75 

5 

- 

- 

+ 

- 

2.13 

6 

+ 

- 

+ 

- 

9.53 

7 

- 

+ 

+ 

- 

1.03 

8 

+ 

+ 

+ 

- 

5.35 

9 

- 

- 

- 

+ 

1.60 

10 

+ 

- 

- 

+ 

11.73 

11 

- 

+ 

- 

+ 

1.16 

12 

+ 

+ 

- 

+ 

4.68 

13 

- 

- 

+ 

+ 

2.16 

14 

+ 

- 

+ 

+ 

9.11 

15 

- 

+ 

+ 

+ 

1.07 

16 

+ 

+ 

+ 

+ 

5.30 


(a) Estimate the factor effects. Plot the effect estimates on a normal probability plot and 
select a tentative model. 

(b) Fit the model identified in part (a) and analyze the residuals. Is there any indication 
of model inadequacy? 

(c) Repeat the analysis from parts (a) and (b) using In (y) as the response variable. Is 
there an indication that the transformation has been useful? 

(d) Fit a model in terms of the coded variables that can be used to predict the resistivity. 

6-32. Continuation of Problem 6-31. Suppose that the experimenter had also run four center 

points along with the 16 runs in Problem 6-31. The resistivity measurements at the center 
points are 8.15, 7.63, 8.95, and 6.48. Analyze the experiment again incorporating the 
center points. What conclusions can you draw now? 

6-33. Often the fitted regression model from a 2* factorial design is used to make predictions 
at points of interest in the design space. 

(a) Find the variance of the predicted response y at a point x u x 2 . x k in the design 

space. Hint: Remember that the x’s are coded variables, and assume a 2 k design with 
an equal number of replicates n at each design point so that the variance of a re¬ 
gression coefficient /3 is o 2 l(n2 k ) and that the covariance between any pair of re¬ 
gression coefficients is zero. 

(b) Use the result in part (a) to find an equation for a 100(1 - a) percent confidence 
interval on the true mean response at the point x„ x 2 , ■ ■ ■, x k in design space. 

6-34. Hierarchial models. Several times we have used the hierarchy principle in selecting a 
model; that is, we have included nonsignificant lower-order terms in a model because they 
were factors involved in significant higher-order terms. Hierarchy is certainly not an ab¬ 
solute principle that must be followed in all cases. To illustrate, consider the model re¬ 
sulting from Problem 6-1, which required that a nonsignificant main effect be included to 
achieve hierarchy. Using the data from Problem 6-1, 

(a) Fit both the hierarchial and the nonhierarchial model. 

(b) Calculate the PRESS statistic, the adjusted R 2 , and the mean square error for both 
models. 

(c) Find a 95 percent confidence interval on the estimate of the mean response at a cube 
comer (x, = x 2 = x 3 = ±1). Hint: Use the results of Problem 6-33. 

(d) Based on the analyses you have conducted, which model do you prefer? 




Blocking 

and Confounding 
in the 2 k Factorial 
Design _ 


7-1 INTRODUCTION 

There are many situations in which it is impossible to perform all of the runs in a 2 k 
factorial experiment under homogeneous conditions. For example, a single batch of raw 
material might not be large enough to make all of the required runs. In other cases, it 
might be desirable to deliberately vary the experimental conditions to ensure that the 
treatments are equally effective (that is, are robust) across many situations that are likely 
to be encountered in practice. For example, a chemical engineer may run a pilot plant 
experiment with several batches of raw material because he knows that different raw 
material batches of different quality grades are likely to be used in the actual full-scale 
process. 

The design technique used in these situations is blocking. In this chapter, we will 
concentrate on some special techniques for blocking in the 2* factorial design. 


7-2 BLOCKING A REPLICATED 2 k FACTORIAL DESIGN 

Suppose that the 2 k factorial design has been replicated n times. This is identical to the 
situation discussed in Chapter 5, where we showed how to run a general factorial design 
in blocks. If there are n replicates, then each set of nonhomogeneous conditions defines 
a block, and each replicate is run in one of the blocks. The runs in each block (or replicate) 

Table 7-1 Chemical Process Experiment in Three Blocks 

Block 1 Block 2 Block 3 


(1) = 28 


(1) = 25 


(1) = 27 

a = 36 


a = 32 


a = 32 

b = 18 


b = 19 


b = 23 

ab = 31 


ab = 30 


ab = 29 


Block totals: B, = 113 B 2 = 106 B 3 = 111 
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Table 7-2 Analysis of Variance for the Chemical Process Experiment in Three Blocks 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

F 0 

P-Value 

Blocks 

6.50 

2 

3.25 



A (concentration) 

208.33 

1 

208.33 

50.32 

0.0004 

B (catalyst) 

75.00 

1 

75.00 

18.12 

0.0053 

AB 

8.33 

1 

8.33 

2.01 

0.2060 

Error 

24.84 

6 

4.14 



Total 

323.00 

11 





would be made in random order. The analysis of the design is similar to that of any 
blocked factorial experiment; for example, see the discussion in Section 5-6. 


EXAMPLE 7-1 . 

Consider the chemical process experiment first described in Section 6-2. Suppose that 
only four experimental trials can be made from a single batch of raw material. Therefore, 
three batches of raw material will be required to run all three replicates of this design. 
Table 7-1 (on the previous page) shows the design, where each batch of raw material 
corresponds to a block. 

The analysis of variance for this blocked design is shown in Table 7-2. All of the 
sums of squares are calculated exactly as in a standard, unblocked 2 k design. The sum 
of squares for blocks is calculated from the block totals. Let B u Bo, and B 3 represent the 
block totals (see Table 7-1). Then 


= 

cks h 4 12 

_ ( 113) 2 + ( 106) 2 + ( 111) 2 


( 330) 2 

12 


= 6.50 


There are two degrees of freedom among the three blocks. Table 7-2 indicates that the 
conclusions from this analysis, had the design been run in blocks, are identical to those 
in Section 6-2 and that the block effect is relatively small. 


7-3 CONFOUNDING IN THE 2 k FACTORIAL DESIGN 

There are many problems for which it is impossible to perform a complete replicate of 
a factorial design in one block. Confounding is a design technique for arranging a 
complete factorial experiment in blocks, where the block size is smaller than the number 
of treatment combinations in one replicate. The technique causes information about cer¬ 
tain treatment effects (usually high-order interactions) to be indistinguishable from, or 
confounded with, blocks. In this chapter we concentrate on confounding systems for 
the 2 k factorial design. Note that even though the designs presented are incomplete block 
designs because each block does not contain all the treatments or treatment combinations, 
the special structure of the 2 k factorial system allows a simplified method of analysis. 

We consider the construction and analysis of the 2 k factorial design in 2 P incomplete 
blocks, where p < k. Consequently, these designs can be run in two blocks, four blocks, 
eight blocks, and so on. 
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7-4 CONFOUNDING THE 2 k FACTORIAL DESIGN 
IN TWO BLOCKS 

Suppose that we wish to run a single replicate of the 2 2 design. Each of the 2 2 = 4 
treatment combinations requires a quantity of raw material, for example, and each batch 
of raw material is only large enough for two treatment combinations to be tested. Thus, 
two batches of raw material are required. If batches of raw material are considered as 
blocks, then we must assign two of the four treatment combinations to each block. 

Figure 7-1 shows one possible design for this problem. The geometric view, Figure 
1-la, indicates that treatment combinations on opposing diagonals are assigned to dif¬ 
ferent blocks. Notice from Figure 1-lb that block 1 contains the treatment combinations 
(1) and ab and that block 2 contains a and b. Of course, the order in which the treatment 
combinations are run within a block is randomly determined. We would also randomly 
decide which block to run first. Suppose we estimate the main effects of A and B just as 
if no blocking had occurred. From Equations 6-1 and 6-2, we obtain 

A = \[ab + a-b- (1)] 

B = \[ab + b - a - (1)] 

Note that both A and B are unaffected by blocking because in each estimate there is one 
plus and one minus treatment combination from each block. That is, any difference 
between block 1 and block 2 will cancel out. 

Now consider the AB interaction 

AB = \[ab + (1) - a - b] 

Because the two treatment combinations with the plus sign [ab and (1)] are in block 1 
and the two with the minus sign ( a and b) are in block 2, the block effect and the AB 
interaction are identical. That is, AB is confounded with blocks. 

The reason for this is apparent from the table of plus and minus signs for the 2 2 
design. This was originally given as Table 6-2, but for convenience it is reproduced as 
Table 7-3 on the next page. From this table, we see that all treatment combinations that 
have a plus sign on AB are assigned to block 1, whereas all treatment combinations that 


• = Run in block 1 
O = Run in block 2 


A 

(o) Geometric view 


Block 1 Block 2 

ph pp 

ab 6 

( b) Assignment of the four 
runs to two blocks 

Figure 7-1 A 2 2 design in two blocks. 
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Table 7-3 Table of Plus and Minus Signs for the 2 2 Design 


Treatment 

Combination 


Factorial Effect 


I 

A 

B 

AB 

(1) 

+ 

- 

- 

+ 

a 

+ 

+ 

- 

- 

b 

+ 

- 

+ 

- 

ab 

+ 

+ 

+ 

+ 


have a minus sign on AB are assigned to block 2. This approach can be used to confound 
any effect (A, B, or AB) with blocks. For example, if (1) and b had been assigned to 
block 1 and a and ab to block 2, the main effect A would have been confounded with 
blocks. The usual practice is to confound the highest-order interaction with blocks. 

This scheme can be used to confound any 2 k design in two blocks. As a second 
example, consider a 2 3 design run in two blocks. Suppose we wish to confound the three- 
factor interaction ABC with blocks. From the table of plus and minus signs shown in 
Table 7-4, we assign the treatment combinations that are minus on ABC to block 1 and 
those that are plus on ABC to block 2. The resulting design is shown in Figure 7-2. Once 
again, we emphasize that the treatment combinations within a block are run in random 
order. 


Other Methods for Constructing the Blocks 

There is another method for constructing these designs. The method uses the linear 
combination 


L = a 1 x l + a 2 x 2 + • • • + a k x k (7-1) 

where x t is the level of the /'th factor appearing in a particular treatment combination and 
a, is the exponent appearing on the /th factor in the effect to be confounded. For the 2 k 
system, we have a, = 0 or 1 and x t = 0 (low level) or x, = 1 (high level). Equation 7-1 
is called a defining contrast. Treatment combinations that produce the same value of L 
(mod 2) will be placed in the same block. Because the only possible values of L (mod 
2) are 0 and 1, this will assign the 2 k treatment combinations to exactly two blocks. 

To illustrate the approach, consider a 2 3 design with ABC confounded with blocks. 


Table 7-4 Table of Plus and Minus Signs for the 2 3 Design 


Treatment 

Combination 




Factorial Effect 




I 

A 

B 

AB 

C 

AC 

BC 

ABC 

(1) 

+ 

- 

- 

+ 

- 

+ 

+ 

- 

a 

+ 

+ 

- 

- 

- 

- 

+ 

+ 

b 

+ 

- 

+ 

- 

- 

+ 

- 

+ 

ab 

+ 

+ 

+ 

+ 

- 

- 

- 

- 

c 

+ 

- 

- 

+ 

+ 

- 

- 

+ 

ac 

+ 

+ 

- 

- 

+ 

+ 

- 

- 

be 

+ 

- 

+ 

- 

+ 

- 

+ 

- 

abc 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 
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(6) Assignment of the eight 
runs to two blocks 

Figure 7-2 The 2 3 design in two blocks with ABC 
confounded. 


Here x 3 corresponds to A, x 2 to B, x 3 to C, and a 1 = a 2 = a 3 - 1. Thus, the defining 
contrast corresponding to ABC is 

L = X 3 + X 2 + X 3 

The treatment combination (1) is written 000 in the (0, 1) notation; therefore, 

L = 1(0) + 1(0) + 1(0) = 0 = 0 (mod 2) 

Similarly, the treatment combination a is 100, yielding 

L = 1(1) + 1(0) + 1(0) =1 = 1 (mod 2) 

Thus, (1) and a would be run in different blocks. For the remaining treatment combi¬ 
nations, we have 

b: L = 1(0) + 1(1) + 1(0) =1 = 1 (mod 2) 
ab: L = 1(1) + 1(1) + 1(0) = 2 = 0 (mod 2) 

c: L = 1(0) + 1(0) + 1(1) =1 = 1 (mod 2) 

ac: L = 1(1) + 1(0) + 1(1) = 2 = 0 (mod 2) 

be: L = 1(0) + 1(1) + 1(1) = 2 = 0 (mod 2) 

abc: L = 1(1) + 1(1) + 1(1) = 3 = 1 (mod 2) 

Thus (1), ab, ac, and be are run in block 1 and a, b, c, and abc are run in block 2. This 
is the same design shown in Figure 7-2, which was generated from the table of plus and 
minus signs. 

Another method may be used to construct these designs. The block containing the 
treatment combination (1) is called the principal block. The treatment combinations in 
this block have a useful group-theoretic property; namely, they form a group with respect 
to multiplication modulus 2. This implies that any element [except (1)] in the principal 
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block may be generated by multiplying two other elements in the principal block modulus 
2. For example, consider the principal block of the 2 3 design with ABC confounded, as 
shown in Figure 7-2. Note that 

ab • ac = a 2 bc = be 
ab • be = ab 2 c = ac 
ac • be = abc 2 = ab 

Treatment combinations in the other block (or blocks) may be generated by multiplying 
one element in the new block by each element in the principal block modulus 2. For the 
2 3 with ABC confounded, because the principal block is (1), ab, ac, and be, we know 
that b is in the other block. Thus, the elements of this second block are 
b • (1) = b 

b • ab = ab 2 = a 
b • ac = abc 

b • be = b 2 c - c 

This agrees with the results obtained previously. 

Estimation of Error 

When the number of variables is small, say k = 2 or 3, it is usually necessary to replicate 
the experiment to obtain an estimate of error. For example, suppose that a 2 3 factorial 
must be run in two blocks with ABC confounded, and the experimenter decides to rep¬ 
licate the design four times. The resulting design might appear as in Figure 7-3. Note 
that ABC is confounded in each replicate. 

The analysis of variance for this design is shown in Table 7-5. There are 32 obser¬ 
vations and 31 total degrees of freedom. Furthermore, because there are eight blocks, 
seven degrees of freedom must be associated with these blocks. One breakdown of those 
seven degrees of freedom is shown in Table 7-5. The error sum of squares actually 
consists of the two-factor interactions between replicates and each of the effects (A, B, 
C, AB, AC, BC). It is usually safe to consider the interactions to be zero and to treat the 
resulting mean square as an estimate of error. Main effects and two-factor interactions 
are tested against the mean square error. Cochran and Cox (1957) observe that the block 
or ABC mean square could be compared to the error for the ABC mean square, which is 
really replicates X blocks. This test usually is very insensitive. 

If resources are sufficient to allow the replication of confounded designs, it is gen¬ 
erally better to use a slightly different method of designing the blocks in each replicate. 
This approach consists of confounding a different effect in each replicate so that some 


Replicate I 
Block 1 Block 2 


Replicate II 
Block 1 Block 2 


Replicate II 
Block 1 Blc 


Figure 7-3 Four replicates of the 2 3 design with ABC confounded. 
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Table 7-5 Analysis of Variance for Four Replicates 
of a 2 3 Design with ABC Confounded 


Source of Variation 

Degrees of 
Freedom 

Replicates 

3 

Blocks (ABC) 

1 

Error for ABC (replicates X blocks) 

3 

A 

1 

B 

1 

C 

1 

AB 

1 

AC 

1 

BC 

1 

Error (or replicates X effects) 

18 

Total 

31 


information on all effects is obtained. Such a procedure is called partial confounding 
and is discussed in Section 7-7. If k is moderately large, say £3= 4, we frequently can 
afford only a single replicate. The experimenter usually assumes higher-order interactions 
to be negligible and combines their sums of squares as error. The normal probability plot 
of factor effects can be very helpful in this regard. 

EXAMPLE 7-2 . 

Consider the situation described in Example 6-2. Recall that four factors—temperature 
(A), pressure (B), concentration of formaldehyde (C), and stirring rate (D)—are studied 
in a pilot plant to determine their effect on product filtration rate. We will use this 
experiment to illustrate the ideas of blocking and confounding in an unreplicated design. 
We will make two modifications to the original experiment. First, suppose that the 2 4 = 
16 treatment combinations cannot all be run using one batch of raw material. The ex¬ 
perimenter can run eight treatment combinations from a single batch of material, so a 2 4 
design confounded in two blocks seems appropriate. It is logical to confound the highest- 
order interaction ABCD with blocks. The defining contrast is 

L = X\ + x 2 + x 3 + x 4 

and it is easy to verify that the design is as shown in Figure 7-4 (on the next page). 
Alternatively, one may examine Table 6-11 and observe that the treatment combinations 
that are + in the ABCD column are assigned to block 1 and those that are — in ABCD 
column are in block 2. 

The second modification that we will make is to introduce a block effect so that the 
utility of blocking can be demonstrated. Suppose that when we select the two batches of 
raw material required to run the experiment, one of them is of much poorer quality and, 
as a result, all responses will be 20 units lower in this material batch than in the other. 
The poor quality batch becomes block 1 and the good quality batch becomes block 2 (it 
doesn’t matter which batch is called block 1 or which batch is called block 2). Now all 
the tests in block 1 are performed first (the eight runs in the block are, of course, per¬ 
formed in random order), but the responses are 20 units lower than they would have been 
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• = Runs in block 1 
o = Runs in block 2 

(a) Geometric view 



Block 1 


(1) = 25 
ab = 45 
ac = 40 
6c = 60 
ad = 80 
bd = 25 
cd = 55 
abed = 76 


Block 2 


o = 71 
6 = 48 
c = 68 
d = 43 
abc = 65 
bed = 70 

abd = 104 


(6) Assignment of the 16 runs 
to two blocks 

Figure 7-4 The 2 4 design in two blocks for Example 7-2. 


if good quality material had been used. Figure 7-4 b shows the resulting responses—note 
that these have been found by subtracting the block effect from the original observations 
given in Example 6-2. That is, the original response for treatment combination (1) was 
45, and in Figure 7-4 b it is reported as (1) = 25 (= 45 - 20). The other responses in 
this block are obtained similarly. After the tests in block 1 are performed, the eight tests 
in block 2 follow. There is no problem with the raw material in this batch, so the responses 
are exactly as they were originally in Example 6-2. 

The effect estimates for this “modified” version of Example 6-2 are shown in Table 
7-6. Note that the estimates of the four main effects, the six two-factor interactions, and 
the four three-factor interactions are identical to the effect estimates obtained in Example 
6-2 where there was no block effect. When a normal probability of these effect estimates 
is constructed, factors A, C, D and the AC and AD interactions emerge as the important 
effects, just as in the original experiment. (The reader should verify this.) 

What about the ABCD interaction effect? The estimate of this effect in the original 
experiment (Example 6-2) was ABCD = 1.375. In the present example, the estimate of 
the ABCD interaction effect is ABCD = —18.625. Because ABCD is confounded with 
blocks, the ABCD interaction estimates the original interaction effect (1.375) plus the 
block effect (-20), so ABCD = 1.375 + (-20) = -18.625. (Do you see why the block 
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Table 7-6 Effect Estimates for the Blocked 2 4 Design in Example 7-2 


Model Term 

Regression 

Coefficient 

Effect 

Estimate 

Sum of 
Squares 

Percent 

Contribution 

A 

10.81 

21.625 

1870.5625 

26.30 

B 

1.56 

3.125 

39.0625 

0.55 

C 

4.94 

9.875 

390.0625 

5.49 

D 

7.31 

14.625 

855.5625 

12.03 

AB 

0.062 

0.125 

0.0625 

<0.01 

AC 

-9.06 

-18.125 

1314.0625 

18.48 

AD 

8.31 

16.625 

1105.5625 

15.55 

BC 

1.19 

2.375 

22.5625 

0.32 

BD 

-0.19 

-0.375 

0.5625 

<0.01 

CD 

-0.56 

-1.125 

5.0625 

0.07 

ABC 

0.94 

1.875 

14.0625 

0.20 

ABD 

2.06 

4.125 

68.0625 

0.96 

ACD 

-0.81 

-1.625 

10.5625 

0.15 

BCD 

-1.31 

-2.625 

27.5625 

0.39 

Blocks (ABCD) 

-18.625 

1387.5625 

19.51 


effect is -20?) The block effect may also be calculated directly as the difference in 
average response between the two blocks, or 

Block Effect = y B iock i “ ymock 2 
406 555 

8 8 
= -149 
8 

= -18.625 


Of course, this effect really estimates Blocks + ABCD. 

Table 7-7 summarizes the analysis of variance for this experiment. The effects with 
large estimates are included in the model, and the block sum of squares is 


(406) 2 + (555) 2 (961) 2 


Table 7-7 Analysis of Variance for Example 7-2 
Source of Sum of Degrees of 

Variation Squares Freedom 

Blocks (ABCD) 1387.5625 1 

A 1870.5625 1 

C 390.0625 1 

D 855.5625 1 

AC 1314.0625 1 

AD 1105.5625 1 

Error 187.5625 9 

Total 7111.4375 


Mean 

Square F 0 P -Value 


1870.5625 89.76 <0.0001 

390.0625 18.72 0.0019 

855.5625 41.05 0.0001 

1314.0625 63.05 <0.0001 

1105.5625 53.05 <0.0001 

20.8403 


15 
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The conclusions from this experiment exactly match those from Example 6-2, where no 
block effect was present. Notice that if the experiment had not been run in blocks, and 
if an effect of magnitude -20 had affected the first 8 trials (which would have been 
selected in a random fashion, because the 16 trials would be run in random order in an 
unblocked design), the results could have been very different. 


7-5 CONFOUNDING THE 2 k FACTORIAL DESIGN 
IN FOUR BLOCKS 

It is possible to construct 2* factorial designs confounded in four blocks of 2 k ~ 2 obser¬ 
vations each. These designs are particularly useful in situations where the number of 
factors is moderately large, say k 4, and block sizes are relatively small. 

As an example, consider the 2 5 design. If each block will hold only eight runs, then 
four blocks must be used. The construction of this design is relatively straightforward. 
Select two effects to be confounded with blocks, say ADE and BCE. These effects have 
the two defining contrasts 

Lj = x, + x 4 + x 5 
L 2 = X2 + x 3 + X 5 

associated with them. Now every treatment combination will yield a particular pair of 
values of L x (mod 2) and L 2 (mod 2), that is, either (L u L 2 ) = (0, 0), (0, 1), (1, 0), or 
(1, 1). Treatment combinations yielding the same values of (L u L 2 ) are assigned to the 
same block. In our example we find 

Li = 0, L 2 = 0 for (1), ad, be, abed, abe, ace, ede, bde 

L\ = 1, L 2 = 0 for a, d, abe, bed, be, abde, ce, aede 

L x = 0, L 2 = 1 for b, abd, c, acd, ae, de, abce, bede 

Lx = \,L 2 — 1 for e, ade, bee, abede, ab, bd, ac, cd 

These treatment combinations would be assigned to different blocks. The complete de¬ 
sign is as shown in Figure 7-5. 

With a little reflection we realize that another effect in addition to ADE and BCE 
must be confounded with blocks. Because there are four blocks with three degrees of 
freedom between them, and because ADE and BCE have only one degree of freedom 
each, clearly an additional effect with one degree of freedom must be confounded. This 
effect is the generalized interaction of ADE and BCE, which is defined as the product 


Block 1 Block 2 Block 3 Block 4 

L, = 0 L, = 1 L, = 0 L, = 1 

L 2 = 0 L 2 = 0 L 2 = 1 L 2 = 1 



Figure 7-5 The 2 5 design in four blocks with ADE, 
BCE, and ABCD confounded. 




7-6 CONFOUNDING THE 2 k FACTORIAL DESIGN IN 2 P BLOCKS 


297 


of ADE and BCE modulus 2. Thus, in our example the generalized interaction 
( ADE)(BCE ) = ABCDE 2 = ABCD is also confounded with blocks. It is easy to verify 
this by referring to a table of plus and minus signs for the 2 5 design, such as in Davies 
(1956). Inspection of such a table reveals that the treatment combinations are assigned 
to the blocks as follows: 


Treatment Combinations in Sign on ADE Sign on BCE Sign on ABCD 


Block 1 
Block 2 
Block 3 
Block 4 


Notice that the product of signs of any two effects for a particular block (e.g., ADE and 
BCE) yields the sign of the other effect for that block (in this case, ABCD). Thus, ADE, 
BCE, and ABCD are all confounded with blocks. 

The group-theoretic properties of the principal block mentioned in Section 7-4 still 
hold. For example, we see that the product of two treatment combinations in the principal 
block yields another element of the principal block. That is, 

ad • be = abed and abe - bde = ab 2 de 2 = ad 

and so forth. To construct another block, select a treatment combination that is not in 
the principal block (e.g., b) and multiply b by all the treatment combinations in the 
principal block. This yields 

b • (1) = b b • ad = abd b • be = b 2 c = c b • abed = ab 2 cd = acd 

and so forth, which will produce the eight treatment combinations in block 3. In practice, 
the principal block can be obtained from the defining contrasts and the group-theoretic 
property, and the remaining blocks can be determined from these treatment combinations 
by the method shown above. 

The general procedure for constructing a 2 k design confounded in four blocks is to 
choose two effects to generate the blocks, automatically confounding a third effect that 
is the generalized interaction of the first two. Then, the design is constructed by using 
the two defining contrasts {L u L 2 ) and the group-theoretic properties of the principal 
block. In selecting effects to be confounded with blocks, care must be exercised to obtain 
a design that does not confound effects that may be of interest. For example, in a 2 5 
design we might choose to confound ABCDE and ABD, which automatically confounds 
CE, an effect that is probably of interest. A better choice is to confound ADE and BCE, 
which automatically confounds ABCD. It is preferable to sacrifice information on the 
three-factor interactions ADE and BCE instead of the two-factor interaction CE. 


7-6 CONFOUNDING THE 2 k FACTORIAL DESIGN IN 2 P BLOCKS 

The methods described above may be extended to the construction of a 2 k factorial design 
confounded in 2 P blocks (p < k), where each block conta ins exactly 2 k ~ p runs. We select 
p independent effects to be confounded, where by “independent” we mean that no effect 
chosen is the generalized interaction of the others. The blocks may be generated by use 


Suggested Blocking Arrangements for the 2 k Factorial Design 
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of the p defining contrasts L x , L 2 ,..., L p associated with these effects. In addition, exactly 
2 P - p - 1 other effects will be confounded with blocks, these being the generalized 
interactions of those p independent effects initially chosen. Care should be exercised in 
selecting effects to be confounded so that information on effects that may be of potential 
interest is not sacrificed. 

The statistical analysis of these designs is straightforward. Sums of squares for all 
the effects are computed as if no blocking had occurred. Then, the block sum of squares 
is found by adding the sums of squares for all the effects confounded with blocks. 

Obviously, the choice of the p effects used to generate the block is critical because 
the confounding structure of the design directly depends on them. Table 7-8 (on the 
facing page) presents a list of useful designs. To illustrate the use of this table, suppose 
we wish to construct a 2 6 design confounded in 2 3 = 8 blocks of 2 3 = 8 runs each. Table 
7-8 indicates that we would choose ABEF, ABCD, and ACE as the p — 3 independent 
effects to generate the blocks. The remaining 2 P - p - 1 = 2 3 — 3-1=4 effects that 
are confounded are the generalized interactions of these three; that is, 

(ABEF){ABCD) = A 2 B 2 CDEF = CDEF 
(ABEF)(ACE) = A 2 BCE 2 F = BCF 
(ABCD)(ACE) = A 2 BC 2 ED = BDE 
(ABEF)(ABCD)(ACE) = A 3 B 2 C 2 DE 2 F = ADF 

The reader is asked to generate the eight blocks for this design in Problem 7-11. 


7-7 PARTIAL CONFOUNDING 

We remarked in Section 7-4 that, unless experimenters have a prior estimate of error or 
are willing to assume certain interactions to be negligible, they must replicate the design 
to obtain an estimate of error. Figure 7-3 shows a 2 3 factorial in two blocks with ABC 
confounded, replicated four times. From the analysis of variance for this design, shown 
in Table 7-5, we note that information on the ABC interaction cannot be retrieved because 
ABC is confounded with blocks in each replicate. This design is said to be completely 
confounded. 

Consider the alternative shown in Figure 7-6. Once again, there are four replicates 
of the 2 3 design, but a different interaction has been confounded in each replicate. That 
is, ABC is confounded in replicate I, AB is confounded in replicate II, BC is confounded 
in replicate III, and AC is confounded in replicate IV. As a result, information on ABC 


Replicate I 


ABC Confounded 



Replicate n 
AB Confounded 


Replicate III 
BC Confounded 


Replicate IV 
AC Confounded 


Figure 7-6 Partial confounding in the 2’ design. 
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Table 7-9 Analysis of Variance for a Partially Confounded 2 3 Design 


Source of Variation 

Degrees of 
Freedom 

Replicates 

3 

Blocks within replicates [or ABC (rep. I) + AB (rep. II) 

+ BC (rep. Ill) + AC (rep. IV)] 

4 

A 

1 

B 

1 

C 

1 

AB (from replicates I, III, and IV) 

1 

AC (from replicates I, n, and IH) 

1 

BC (from replicates I, II, and IV) 

1 

ABC (from replicates II, HI, and IV) 

1 

Error 

17 

Total 

31 


can be obtained from the data in replicates II, III, and IV; information on AB can be 
obtained from replicates I, III, and IV; information on AC can be obtained from replicates 
I, II, and III; and information on BC can be obtained from replicates I, II, and IV. We 
say that three-quarters information can be obtained on the interactions because they are 
unconfounded in only three replicates. Yates (1937) calls the ratio 3/4 the relative in¬ 
formation for the confounded effects. This design is said to be partially confounded. 

The analysis of variance for this design is shown in Table 7-9. In calculating the 
interaction sums of squares, only data from the replicates in which an interaction is 
unconfounded are used. The error sum of squares consists of replicates X main effect 
sums of squares plus replicates X interaction sums of squares for each replicate in which 
that interaction is unconfounded (e.g., replicates X ABC for replicates II, III, and IV). 
Furthermore, there are seven degrees of freedom among the eight blocks. This is usually 
partitioned into three degrees of freedom for replicates and four degrees of freedom for 
blocks within replicates. The composition of the sum of squares for blocks is shown in 
Table 7-9 and follows directly from the choice of the effect confounded in each replicate. 


EXAMPLE 7-3 . 

A 2 3 Design with Partial Confounding 

Consider Example 6-1, in which a study was performed to determine the effect of per¬ 
centage of carbonation (A), operating pressure (B), and line speed (C) on the fill height 
of a carbonated beverage. Suppose that each batch of syrup is only large enough to test 
four treatment combinations. Thus, each replicate of the 2 3 design must be run in two 
blocks. Two replicates are run, with ABC confounded in replicate I and AB confounded 
in replicate II. The data are as follows: 


Replicate I 
ABC Confounded 


Replicate II 
AB Confounded 



(1) = 

-1 

c = 

0 

ab = 

3 

abc = 

5 


b = 0 
be = 1 
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Table 7-10 Analysis of Variance for Example 7-3 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

Replicates 

1.00 

1 

1.00 

— 


Blocks within replicates 

2.50 

2 

1.25 

— 


A 

36.00 

1 

36.00 

48.00 

0.0001 

B 

20.25 

1 

20.25 

27.00 

0.0035 

C 

12.25 

1 

12.25 

16.33 

0.0099 

AB (rep. I only) 

0.50 

1 

0.50 

0.67 

0.4503 

AC 

0.25 

1 

0.25 

0.33 

0.5905 

BC 

1.00 

1 

1.00 

1.33 

0.3009 

ABC (rep. II only) 

0.50 

1 

0.50 

0.67 

0.4503 

Error 

3.75 

5 

0.75 

— 


Total 

78.00 

15 

- 

- 



The sums of squares for A, B, C, AC, and BC may be calculated in the usual manner, 
using all 16 observations. However, we must find SS ABC using only the data in replicate 
II and SS AB using only the data in replicate I as follows: 


[a + b + c + abc - ab - ac — be - (l)] 2 


[1 + 0 + 0 + 5 — 3 — 1 — 1— (-1)] 2 


(D(8) 


= [-3 + 6 - 2 + (-1) - 0 - (-1) + 2 - l] 2 

(D(8) 

The sum of squares for the replicates is, in general, 

Rep " & 2 k N 

_ ( 6) 2 + ( 10) 2 _ ( 16) 2 _ 


where R h is the total of the observations in the Mi replicate. The block sum of squares 
is the sum of SS ABC from replicate I and SS AB from replicate H, or SSuiocks — 2.50. 

The analysis of variance is summarized in Table 7-10. All three main effects are 
important. 


7-8 PROBLEMS - 

7-1. Consider the experiment described in Problem 6-1. Analyze this experiment assuming that 
each replicate represents a block of a single production shift. 

7-2. Consider the experiment described in Problem 6-5. Analyze this experiment assuming that 
each one of the four replicates represents a block. 

7-3. Consider the alloy cracking experiment described in Problem 6-15. Suppose that only 16 
runs could be made on a single day, so each replicate was treated as a block. Analyze the 
experiment and draw conclusions. 
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7-4. Consider the data from the first replicate of Problem 6-1. Suppose that these observations 
could not all be run using the same bar stock. Set up a design to run these observations 
in two blocks of four observations each with ABC confounded. Analyze the data. 

7-5. Consider the data from the first replicate of Problem 6-7. Construct a design with two 
blocks of eight observations each with ABCD confounded. Analyze the data. 

7-6. Repeat Problem 7-5 assuming that four blocks are required. Confound ABD and ABC (and 
consequently CD) with blocks. 

7-7. Using the data from the 2 5 design in Problem 6-21, construct and analyze a design in two 
blocks with ABCDE confounded with blocks. 

7-8. Repeat Problem 7-7 assuming that four blocks are necessary. Suggest a reasonable con¬ 
founding scheme. 

7-9. Consider the data from the 2 3 design in Problem 6-21. Suppose that it was necessary to 
run this design in four blocks with ACDE and BCD (and consequently ABE) confounded. 
Analyze the data from this design. 

7-10. Design an experiment for confounding a 2 6 factorial in four blocks. Suggest an appropriate 
confounding scheme, different from the one shown in Table 7-8. 

7-11. Consider the 2 6 design in eight blocks of eight runs each with ABCD, ACE, and ABEF as 
the independent effects chosen to be confounded with blocks. Generate the design. Find 
the other effects confounded with blocks. 

7-12. Consider the 2 2 design in two blocks with AB confounded. Prove algebraically that 
SS AB = SS Bl0 ck S . 

7-13. Consider the data in Example 7-2. Suppose that all the observations in block 2 are in¬ 
creased by 20. Analyze the data that would result. Estimate the block effect. Can you 
explain its magnitude? Do blocks now appear to be an important factor? Are any other 
effect estimates impacted by the change you made to the data? 

7-14 Suppose that in Problem 6-1 we had confounded ABC in replicate I, AB in replicate II, 
and BC in replicate HI. Calculate the factor effect estimates. Construct the analysis of 
variance table. 

7-15. Repeat Problem 6-1 assuming that ABC was confounded with blocks in each replicate. 

7-16. Suppose that in Problem 6-7 ABCD was confounded in replicate I and ABC was con¬ 
founded in replicate II. Perform the statistical analysis of this design. 

7-17. Construct a 2 3 design with ABC confounded in the first two replicates and BC confounded 

in the third. Outline the analysis of variance and comment on the information obtained. 




Two-Level Fractional 
Factorial Designs 


8-1 INTRODUCTION 

As the number of factors in a 2* factorial design increases, the number of runs required 
for a complete replicate of the design rapidly outgrows the resources of most experi¬ 
menters. For example, a complete replicate of the 2 6 design requires 64 runs. In this 
design only 6 of the 63 degrees of freedom correspond to main effects, and only 15 
degrees of freedom correspond to two-factor interactions. The remaining 42 degrees of 
freedom are associated with three-factor and higher interactions. 

If the experimenter can reasonably assume that certain high-order interactions are 
negligible, information on the main effects and low-order interactions may be obtained 
by running only a fraction of the complete factorial experiment. These fractional fac¬ 
torial designs are among the most widely used types of designs for product and process 
design and for process improvement. 

A major use of fractional factorials is in screening experiments. These are experi¬ 
ments in which many factors are considered and the objective is to identify those factors 
(if any) that have large effects. Screening experiments are usually performed in the early 
stages of a project when it is likely that many of the factors initially considered have 
little or no effect on the response. The factors that are identified as important are then 
investigated more thoroughly in subsequent experiments. 

The successful use of fractional factorial designs is based on three key ideas: 

1. The sparsity of effects principle. When there are several variables, the system 
or process is likely to be driven primarily by some of the main effects and low- 
order interactions. 

2. The projection property. Fractional factorial designs can be projected into 
stronger (larger) designs in the subset of significant factors. 

3. Sequential experimentation. It is possible to combine the runs of two (or more) 
fractional factorials to assemble sequentially a larger design to estimate the factor 
effects and interactions of interest. 
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We will focus on these principles in this chapter and illustrate them with several 
examples. 

8-2 THE ONE-HALF FRACTION OF THE 2 k DESIGN 

Consider a situation in which three factors, each at two levels, are of interest, but the 
experimenters cannot afford to run all 2 3 = 8 treatment combinations. They can, however, 
afford four runs. This suggests a one-half fraction of a 2 3 design. Because the design 
contains 2 3_1 = 4 treatment combinations, a one-half fraction of the 2 3 design is often 
called a 2 3- ' design. 

The table of plus and minus signs for the 2 3 design is shown in Table 8-1. Suppose 
we select the four treatment combinations a, b, c, and abc as our one-half fraction. These 
runs are shown in the top half of Table 8-1 and in Figure 8-la. 

Notice that the 2 31 design is formed by selecting only those treatment combinations 
that have a plus in the ABC column. Thus, ABC is called the generator of this particular 
fraction. Sometimes we will refer to a generator such as ABC as a word. Furthermore, 
the identity column I is also always plus, so we call 
I = ABC 

the defining relation for our design. In general, the defining relation for a fractional 
factorial will always be the set of all columns that are equal to the identity column /. 

The treatment combinations in the 2 3_1 design yield three degrees of freedom that 
we may use to estimate the main effects. Referring to Table 8-1, we note that the linear 
combinations of the observations used to estimate the main effects of A, B, and C are 
t A = \{a — b - c + abc) 
i B = \(-a + b — c + abc ) 

Z c = |(-a - b + c + abc) 

It is also easy to verify that the linear combinations of the observations used to estimate 
the two-factor interactions are 

( BC = s(a - b - c + abc) 

Zac = 2 (~a + b - c + abc) 

Zab — \i~ a — b + c + abc) 


Table 8-1 Plus and Minus Signs for the 2 3 Factorial Design 


Treatment 

Combination 




Factorial Effect 




/ 

A 

B 

C 

AB 

AC 

BC 

ABC 

a 

+ 

+ 

- 

- 

- 

- 

+ 

+ 

b 

+ 

- 

+ 

- 

- 

+ 

- 

+ 

c 

+ 

- 

- 

+ 

+ 

- 

- 

+ 

abc 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

ab 

+ 

+ 

+ 

- 

+ 

- 

- 

- 

ac 

+ 

+ 

- 

+ 

- 

+ 

- 

- 

be 

+ 

- 

+ 

+ 

- 

- 

+ 

- 

(1) 

+ 

- 

- 

- 

+ 

+ 

+ 

- 
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(6) The alternate fraction, I = -ABC 

Figure 8-1 The two one-half fractions of the 2 3 

design. 


Thus, € A = € bc , € s = £ ac , and l c = (ab\ consequently, it is impossible to differentiate 
between A and BC, B and AC, and C and AB. In fact, when we estimate A, B, and C we 
are really estimating A + BC, B + AC, and C + AB. Two or more effects that have this 
property are called aliases. In our example, A and BC are aliases, B and AC are aliases, 
and C and AB are aliases. We indicate this by the notation i A — * A + BC, ( B —» B + 
AC, and » C + AB. 

The alias structure for this design may be easily determined by using the defining 
relation I = ABC. Multiplying any column (or effect) by the defining relation yields the 
aliases for that column (or effect). In our example, this yields as the alias of A 

A • I = A • ABC = A 2 BC 

or, because the square of any column is just the identity I, 

A = BC 

Similarly, we find the aliases of B and C as 

B ' I = B • ABC 
B = AB 2 C = AC 


and 


C ■ 1 = C • ABC 
C = ABC 2 = AB 

This one-half fraction, with 1 = +ABC, is usually called the principal fraction. 

Now suppose that we had chosen the other one-half fraction, that is, the treatment 
combinations in Table 8-1 associated with minus in the ABC column. This alternate, or 
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complementary, one-half fraction (consisting of the runs (1), ab, ac, and be) is shown 
in Figure 8-1 b. The defining relation for this design is 
I = —ABC 

The linear combination of the observations, say i' A , ig, and i' c , from the alternate fraction 
gives us 

A-BC 
tg B — AC 
Vc^C - AB 

Thus, when we estimate A , B, and C with this particular fraction, we are really estimating 
A - BC, B — AC, and C - AB. 

In practice, it does not matter which fraction is actually used. Both fractions belong 
to the same family ; that is, the two one-half fractions form a complete 2 3 design. This 
is easily seen by reference to parts a and b of Figure 8-1. 

Suppose that after running one of the one-half fractions of the 2 3 design, the other 
one was also run. Thus, all eight runs associated with the full 2 3 are now available. We 
may now obtain de-aliased estimates of all the effects by analyzing the eight runs as a 
full 2 3 design in two blocks of four runs each. This could also be done by adding and 
subtracting the linear combination of effects from the two individual fractions. For ex¬ 
ample, consider i A — > A + BC and € A —* A — BC. This implies that 
+ €A) = \(A + BC + A- BC)^ A 

and that 

±(€„ “ *a) = 5 (A + BC - A + BC) -» BC 
Thus, for all three pairs of linear combinations, we would obtain the following: 


i 

From + (',) 

From j(€, — i\) 

A 

A 

BC 

B 

B 

AC 

C 

C 

AB 


Design Resolution 

The preceding 2 3_l design is called a resolution III design. In such a design, main 
effects are aliased with two-factor interactions. A design is of resolution R if no p-factor 
effect is aliased with another effect containing less than R - p factors. We usually employ 
a Roman numeral subscript to denote design resolution; thus, the one-half fraction of the 
2 3 design with the defining relation I = ABC (or / = -ABC) is a 2 3 n 1 design. 

Designs of resolution in, IV, and V are particularly important. The definitions of 
these designs and an example of each follow: 

1. Resolution III designs. These are designs in which no main effects are aliased 
with any other main effect, but main effects are aliased with two-factor 
interactions and two-factor interactions may be aliased with each other. The 2 3 1 
design in Table 8-1 is of resolution IH (2 3 n'). 

2 . Resolution IV designs. These are designs in which no main effect is aliased 
with any other main effect or with any two-factor interaction, but two-factor 
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interactions are aliased with each other. A 2 4 1 design with I = ABCD is a 
resolution IV design (2^). 

3. Resolution V designs. These are designs in which no main effect or two-factor 
interaction is aliased with any other main effect or two-factor interaction, but 
two-factor interactions are aliased with three-factor interactions. A 2 5_1 design 
with I = ABCDE is a resolution V design (2V 1 ). 

In general, the resolution of a two-level fractional factorial design is equal to the 
smallest number of letters in any word in the defining relation. Consequently, we could 
call the preceding design types three-, four-, and five-letter designs, respectively. We 
usually like to employ fractional designs that have the highest possible resolution con¬ 
sistent with the degree of fractionation required. The higher the resolution, the less re¬ 
strictive the assumptions that are required regarding which interactions are negligible to 
obtain a unique interpretation of the data. 

Constructing One-Half Fractions 

A one-half fraction of the 2* design of the highest resolution may be constructed by 
writing down a basic design consisting of the runs for a full 2 k ~ 1 factorial and then 
adding the Mi factor by identifying its plus and minus levels with the plus and minus 
signs of the highest-order interaction ABC ■ • • (K — 1). Therefore, the 2, 3 n 1 fractional 
factorial is obtained by writing down the full 2 2 factorial as the basic design and then 
equating factor C to the AB interaction. The alternate fraction would be obtained by 
equating factor C to the -AB interaction. This approach is illustrated in Table 8-2. Notice 
that the basic design always has the right number of runs (rows), but it is missing one 
column. The generator / = ABC • • • K is then solved for the missing column (K) so that 
K = ABC ■ ■ ■ (K - 1) defines the product of plus and minus signs to use in each row to 
produce the levels for the Mi factor. 

Note that any interaction effect could be used to generate the column for the Mi 
factor. However, using any effect other than ABC ■■■ (K — 1) will not produce a design 
of the highest possible resolution. 

Another way to view the construction of a one-half fraction is to partition the runs 
into two blocks with the highest-order interaction ABC • • ■ K confounded. Each block is 
a 2* _1 fractional factorial design of the highest resolution. 

Projection of Fractions into Factorials 

Any fractional factorial design of resolution R contains complete factorial designs (pos¬ 
sibly replicated factorials) in any subset of R — 1 factors. This is an important and useful 
concept. For example, if an experimenter has several factors of potential interest but 


Table 8-2 

The Two One-Half Fractions of the 2 3 

1 Design 



Run 

Full 2 2 
Factorial 
(Basic 
Design) 


2jr\ 1 = ABC 


2m 1 ,/ = -ABC 

A B 

A 

B 

C =AB 

A 

B C = -AB 

1 

2 

+ 

+ 

- 

+ 

+ 

- + 

3 

+ 

- 

+ 

- 

- 

+ + 

4 

+ + 

+ 

+ 

+ 

+ 

+ 



308 CHAPTER 8 TWO-LEVEL FRACTIONAL FACTORIAL DESIGNS 





Figure 8-2 Projection of a 2m' design 
into three 2 2 designs. 


believes that only R - 1 of them have important effects, then a fractional factorial design 
of resolution R is the appropriate choice of design. If the experimenter is correct, the 
fractional factorial design of resolution R will project into a full factorial in the R — 1 
significant factors. This process is illustrated in Figure 8-2 for the 2\ n 1 design, which 
projects into a 2 2 design in every subset of two factors. 

Because the maximum possible resolution of a one-half fraction of the 2 design is 
R = k, every 2 k ~ l design will project into a full factorial in any (k — 1) of the original 
k factors. Furthermore, a 2* _1 design may be projected into two replicates of a full 
factorial in any subset of k - 2 factors, four replicates of a full factorial in any subset 
of k - 3 factors, and so on. 


EXAMPLE 8-1 ... 

Consider the filtration rate experiment in Example 6-2. The original design, shown in 
Table 6-10, is a single replicate of the 2 4 design. In that example, we found that the main 
effects A, C, and D and the interactions AC and AD were different from zero. We will 
now return to this experiment and simulate what would have happened if a half-fraction 
of the 2 4 design had been run instead of the full factorial. 

We will use the 2 4-1 design with I = ABCD, because this choice of generator will 
result in a design of the highest possible resolution (IV). To construct the design, we first 
write down the basic design, which is a 2 3 design, as shown in the first three columns of 
Table 8-3. This basic design has the necessary number of runs (eight) but only three 
columns (factors). To find the fourth factor levels, solve I = ABCD for D, or D = ABC. 
Thus, the level of D in each run is the product of the plus and minus signs in columns 


Table 8-3 The 2?y 1 Design wit h the Defining Relation I - ABCD 
Basic Design 


ABC D= ABC 


+ + 
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Figure 8-3 The 2f v 1 design for the filtration rate experiment of Example 8-1. 


A, B, and C. The process is illustrated in Table 8-3. Because the generator ABCD is 
positive, this 2fv 1 design is the principal fraction. The design is shown graphically in 
Figure 8-3. 

Using the defining relation, we note that each main effect is aliased with a three- 
factor interaction; that is, A = A 2 BCD = BCD, B = AB 2 CD = ACD, C = ABC 2 D = 
ABD, and D = ABCD 2 = ABC. Furthermore, every two-factor interaction is aliased with 
another two-factor interaction. These alias relationships are AB = CD, AC = BD, and 
BC = AD. The four main effects plus the three two-factor interaction alias pairs account 
for the seven degrees of freedom for the design. 

At this point, we would normally randomize the eight runs and perform the exper¬ 
iment. Because we have already run the full 2 4 design, we will simply select the eight 
observed filtration rates from Example 6-2 that correspond to the runs in the 2 4 v 1 design. 
These observations are shown in the last column of Table 8-3 and are also shown in 
Figure 8-3. 

The estimates of the effects obtained from this 2?v 1 design are shown in Table 8-4. 
To illustrate the calculations, the linear combination of observations associated with the 
A effect is 

i A = K-45 + 100 - 45 + 65 - 75 + 60 - 80 + 96) = 19.00 —» A + BCD 
whereas for the AB effect, we would obtain 

( AB = |(45 - 100 - 45 + 65 + 75 - 60 - 80 + 96) = -1.00 -> AB + CD 

From inspection of the information in Table 8-4, it is not unreasonable to conclude that 
the main effects A, C, and D are large. Furthermore, if A, C, and D are the important 
main effects, then it is logical to conclude that the two interaction alias chains AC + BD 

Table 8-4 Estimates of Effects and 

Aliases from Example 8-1“ 

Estimate Alias Structure 

i A = 19.00 (, A -> A + BCD 

e B = 1.50 € b —>B+ ACD 

i c = 14.00 e c -» C + ABD 

e D = 16.50 e D D + ABC 

( AB = -1.00 ( ab ^>AB + CD 

e AC = - 18.50 e AC —» AC + BD 

e AD = 19.00 i AD ~^AD + BC 

* Significant effects are shown in boldface type. 
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75 96 



Figure 8-4 Projection of the 2? v ' design into a 2 3 design in A, C, and 
D for Example 8-1. 


and AD + BC have large effects because the AC and AD interactions are also significant. 
In other words, if A, C, and D are significant then the significant interactions are most 
likely AC and AD. This is an application of Ockham’s razor (after William of Ockham), 
a scientific principle that when one is confronted with several different possible inter¬ 
pretations of a phenomena, the simplest interpretation is usually the correct one. Note 
that this interpretation agrees with the conclusions from the analysis of the complete 2 4 
design in Example 6-2. 

Because factor B is not significant, we may drop it from consideration. Consequently, 
we may project this 2 4 y 1 design into a single replicate of the 2 3 design in factors A, C, 
and D, as shown in Figure 8-4. Visual examination of this cube plot makes us more 
comfortable with the conclusions reached above. Notice that if the temperature (A) is at 
the low level, the concentration (C) has a large positive effect, whereas if the temperature 
is at the high level, the concentration has a very small effect. This is probably due to an 
AC interaction. Furthermore, if the temperature is at the low level, the effect of the stirring 
rate (D) is negligible, whereas if the temperature is at the high level, the stirring rate has 
a large positive effect. This is probably due to the AD interaction tentatively identified 
previously. 

Based on the above analysis, we can now obtain a model to predict filtration rate 
over the experimental region. This model is 

$ = Po + jSl-Tl + P3X3 + p 4 x 4 + p 13 x r x 3 + p l4 x x x 4 


where x u x 3 , and x 4 are coded variables (—1 =£*, =£ + 1) that represent A, C, and D, 
and the P’s are regression coefficients that can be obtained from the effect estimates as 
we did previously. Therefore, the prediction equation is 


y = 70.75 + 




v 2 r 


( 16.50\ / —18.50\ , /19.00\ 

rr-J^+v-r-J ** + r2~r 


Remember that the intercept p 0 is the average of all responses at the eight runs in the 
design. This model is very similar to the one that resulted from the full 2 k factorial design 
in Example 6-2. 
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EXAMPLE 8-2 . 

A 2 s-1 Design Used for Process Improvement 

Five factors in a manufacturing process for an integrated circuit were investigated in a 
2 5_1 design with the objective of improving the process yield. The five factors were 
A = aperture setting (small, large), B = exposure time (20 percent below nominal, 20 
percent above nominal), C = develop time (30 s, 45 s), D = mask dimension (small, 
large), and E = etch time (14.5 min, 15.5 min). The construction of the 2 5_1 design is 
shown in Table 8-5. Notice that the design was constructed by writing down the basic 
design having 16 runs (a 2 4 design in A, B, C, and D), selecting ABCDE as the generator, 
and then setting the levels of the fifth factor £ = ABCD. Figure 8-5 (on page 312) gives 
a pictorial representation of the design. 

The defining relation for the design is I = ABCDE. Consequently, every main effect 
is aliased with a four-factor interaction (for example, £ A —*■ A + BCDE), and every two- 
factor interaction is aliased with a three-factor interaction (for example, C AB —» AB + 
CDE). Thus, the design is of resolution V. We would expect this 2 5 ~ 1 design to provide 
excellent information concerning the main effects and two-factor interactions. 

Table 8-6 (on page 312) contains the effect estimates, sums of squares, and model 
regression coefficients for the 15 effects from this experiment. Figure 8-6 (on page 313) 
presents a normal probability plot of the effect estimates from this experiment. The main 
effects of A, B, and C and the AB interaction are large. Remember that, because of 
aliasing, these effects are really A + BCDE, B + ACDE, C + ABDE, and AB + CDE. 
However, because it seems plausible that three-factor and higher interactions are negli¬ 
gible, we feel safe in concluding that only A, B ,C, and AB are important effects. 

Table 8-7 (page 313) summarizes the analysis of variance for this experiment. The 
model sum of squares is SS Mode i = SS A + SS B + SS C + SS AB = 5747.25, and this accounts 
for over 99 percent of the total variability in yield. Figure 8-7 (paged 313) presents a 


Table 8-5 A 2 5_1 Design for Example 8-2 
Basic Design 

Run ABCD 


2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


Treatment 

E = ABCD Combination 


b 


+ bee 

d 

+ ade 

+ bde 

abd 

+ ede 

bed 

+ abede 


Yield 


9 

34 

52 

16 

22 

45 

60 

6 

10 

30 

50 

15 

21 

44 

63 






Figure 8-5 The 2v ' design for Example 8-2. 


Table 8-6 Effects, Regression Coefficients, and Sums of Squares for Example 8-2 


Variable 

Name 

-1 Level 

+ 1 Level 

A 

Aperture 

-1.000 

1.000 

B 

Develop time 

-1.000 

1.000 

C 

Exposure time 

-1.000 

1.000 

D 

Mask dimension 

-1.000 

1.000 

E 

Etch time 

-1.000 

1.000 

Variable 

Regression Coefficient 

Estimated Effect 

Sum of Squares 

Overall Average 

30.3125 



A 

5.5625 

11.1250 

495.062 

B 

16.9375 

33.8750 

4590.062 

C 

5.4375 

10.8750 

473.062 

D 

-0.4375 

-0.8750 

3.063 

E 

0.3125 

0.6250 

1.563 

AB 

3.4375 

6.8750 

189.063 

AC 

0.1875 

0.3750 

0.563 

AD 

0.5625 

1.1250 

5.063 

AE 

0.5625 

1.1250 

5.063 

BC 

0.3125 

0.6250 

1.563 

BD 

-0.0625 

-0.1250 

0.063 

BE 

-0.0625 

-0.1250 

0.063 

CD 

0.4375 

0.8750 

3.063 

CE 

0.1875 

0.3750 

0.563 

DE 

-0.6875 

-1.3750 

7.563 


312 
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10 20 30 40 50 60 

Predicted yield 

Figure 8-8 Plot of residuals versus predicted yield for Example 8-2. 


normal probability plot of the residuals, and Figure 8-8 is a plot of the residuals versus 
the predicted values. Both plots are satisfactory. 

The three factors A, B, and C have large positive effects. The AB or aperture exposure 
time interaction is plotted in Figure 8-9. TTiis plot confirms that the yields are higher 
when both A and B are at the high level. 



Low High 


Figure 8-9 Aperture-exposure time interaction for Example 8-2. 
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44.5 61.5 



Figure 8-10 Projection of the 2(r 1 design in Ex¬ 
ample 8-2 into two replicates of a 2' design in the 
factors A, B, and C. 


The 2 5-1 design will collapse into two replicates of a 2 3 design in any three of the 
original five factors. (Looking at Figure 8-5 will help you visualize this.) Figure 8-10 is 
a cube plot in the factors A, B, and C with the average yields superimposed on the eight 
comers. It is clear from inspection of the cube plot that highest yields are achieved with 
A, B, and C all at the high level. Factors D and E have little effect on average process 
yield and may be set to values that optimize other objectives (such as cost). 


Sequences of Fractional Factorials 

Using fractional factorial designs often leads to great economy and efficiency in exper¬ 
imentation, particularly if the runs can be made sequentially. For example, suppose that 
we are investigating k = 4 factors (2 4 = 16 runs). It is almost always preferable to ran 
a 2 4 v 1 fractional design (eight runs), analyze the results, and then decide on the best set 
of runs to perform next. If it is necessary to resolve ambiguities, we can always mn the 
alternate fraction and complete the 2 4 design. When this method is used to complete the 
design, both one-half fractions represent blocks of the complete design with the highest- 
order interaction confounded with blocks (here ABCD would be confounded). Thus, 
sequential experimentation has the result of losing information only on the highest-order 
interaction. Its advantage is that in many cases we learn enough from the one-half fraction 
to proceed to the next stage of experimentation, which might involve adding or removing 
factors, changing responses, or varying some of the factors over new ranges. Some of 
these possibilities are illustrated graphically in Figure 8-11 on the following page. 


EXAMPLE 8-3 . 

Reconsider the experiment in Example 8-2. We have used a 2™ 1 design and tentatively 
identified three large main effects— A, C, and D. There are two large effects associated 
with two-factor interactions, AC + BD and AD + BC. In Example 8-2, we used the fact 
that the main effect of B was negligible to tentatively conclude that the important inter- 
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Figure 8-11 Possibilities for follow-up experimentation after a fractional factorial experiment 
(adapted from Box (1992-1993), with permission of the publisher). 


actions were AC and AD. Sometimes the experimenter will have process knowledge that 
can assist in discriminating between interactions likely to be important. However, we 
can always isolate the significant interaction by running the alternate fraction, given by 
I — —ABCD. It is straightforward to show that the design and the responses are as 
follows: 


Run 

Basic Design 

A B 

C 

D = -ABC 

Treatment 

Combination 

Filtration 

Rate 

1 

- 

- 

+ 

d 

43 

2 

+ 

- 

- 

a 

71 

3 

- + 

- 

- 

b 

48 

4 

+ + 

- 

+ 

abd 

104 

5 

- 

+ 

- 

c 

68 

6 

+ - 

+ 

+ 

acd 

86 

7 

- + 

+ 

+ 

bed 

70 

8 

+ + 

+ 

- 

abc 

65 






8-3 THE ONE-QUARTER FRACTION OF THE 2 k DESIGN 


317 


The linear combinations of observations obtained from this alternate fraction are 
V A = 24.25 -*• A - BCD 

V B = 4.75 —» B — ACD 

t' c = 5.75 C ~ ABD 

€1, = 12.75 -» D — ABC 

Z' AB = 1.25 —> AB — CD 

V AC = -17.75 -> AC - BD 
V AD = 14.25 AD - BC 

These estimates may be combined with those obtained from the original one-half fraction 
to yield the following estimates of the effects: 


; 

From §(€,■ + €,') 

From 5(f, - €') 

A 

21.63 -* A 

-2.63 BCD 

B 

3.13 —> B 

—1.63 —» ACD 

C 

9.88 -> C 

4.13 ^ ABD 

D 

14.63 -* D 

1.88 ABC 

AB 

0.13 —> AB 

— 1.13 -> CD 

AC 

-18.13 —>AC 

-0.38 -» BD 

AD 

16.63 AD 

2.38 -> BC 


These estimates agree exactly with those from the original analysis of the data as a single 
replicate of a 2 4 factorial design, as reported in Example 6-2. Clearly, it is the AC and 
AD interactions that are large. 


Adding the alternate fraction to the principal fraction may be thought of as a type 
of confirmation experiment in that it provides information that will allow us to 
strengthen our initial conclusions about the two-factor interaction effects. We will in¬ 
vestigate some other aspects of combining fractional factorials to isolate interactions in 
Section 8-5. Sometimes a confirmation experiment is not this elaborate. For example, 
one could use the model equation to predict the response at a point of interest within the 
design space (not one of the points in the current design), then actually run that trial 
(perhaps several times) and use the comparison between predicted and observed response 
to confirm the results. 


8-3 THE ONE-QUARTER FRACTION OF THE 2 k DESIGN 

For a moderately large number of factors, smaller fractions of the 2 k design are frequently 
useful. Consider a one-quarter fraction of the 2 k design. This design contains 2 k ~ 2 runs 
and is usually called a 2 k ~ 2 fractional factorial. 

The 2 A ~ 2 design may be constructed by first writing down a basic design consisting 
of the runs associated with a full factorial in k — 2 factors and then associating the two 
additional columns with appropriately chosen interactions involving the first k — 2 fac¬ 
tors. Thus, a one-quarter fraction of the 2 k design has two generators. If P and Q represent 
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the generators chosen, then I = P and I = Q are called the generating relations for the 
design. The signs of P and Q (either + or —) determine which one of the one-quarter 
fractions is produced. All four fractions associated with the choice of generators ±P 
and ±Q are members of the same family. The fraction for which both P and Q are 
positive is the principal fraction. 

The complete defining relation for the design consists of all the columns that are 
equal to the identity column 7. These will consist of P, Q, and their generalized inter¬ 
action PQ\ that is, the defining relation is / = P = Q = PQ. We call the elements P, 
Q, and PQ in the defining relation words. The aliases of any effect are produced by the 
multiplication of the column for that effect by each word in the defining relation. Clearly, 
each effect has three aliases. The experimenter should be careful in choosing the gen¬ 
erators so that potentially important effects are not aliased with each other. 

As an example, consider the 2 6-2 design. Suppose we choose 7 = ABCE and 7 = 
BCDF as the design generators. Now the generalized interaction of the generators ABCE 
and BCDF is ADEF ; therefore, the complete defining relation for this design is 
7 = ABCE = BCDF = ADEF 

Consequently, this is a resolution IV design. To find the aliases of any effect (e.g., A), 
multiply that effect by each word in the defining relation. For A, this produces 

A = BCE = ABCDF = DEF 

It is easy to verify that every main effect is aliased by three- and five-factor interactions, 
whereas two-factor interactions are aliased with each other and with higher-order inter¬ 
actions. Thus, when we estimate A, for example, we are really estimating A + BCE + 
DEF + ABCDF. The complete alias structure of this design is shown in Table 8-8. If 
three-factor and higher interactions are negligible, this design gives clear estimates of 
the main effects. 

To construct the design, first write down the basic design, which consists of the 16 
runs for a full 2 6-2 = 2 4 design in A, B, C, and D. Then the two factors E and F are 
added by associating their plus and minus levels with the plus and minus signs of the 
interactions ABC and BCD, respectively. This procedure is shown in Table 8-9. 

Another way to construct this design is to derive the four blocks of the 2 6 design 
with ABCE and BCDF confounded and then choose the block with treatment combina¬ 
tions that are positive on ABCE and BCDF. This would be a 2 6-2 fractional factorial 
with generating relations 7 = ABCE and 7 = BCDF, and because both generators ABCE 
and BCDF are positive, this is the principal fraction. 

There are, of course, three alternate fractions of this particular 2fv 2 design. They 


Table 8-8 Alias Structure for the 2f v 2 Design with I = ABCE = 
BCDF = ADEF 


A = BCE = DEF = ABCDF 
B = ACE = CDF = ABDEF 
C = ABE = BDF = ACDEF 
D = BCF = AEF = ABCDE 
E = ABC = ADF = BCDEF 
F = BCD = ADE = ABCEF 


AB = CE = ACDF = BDEF 
AC = BE = ABDF = CDEF 
AD = EF = BCDE = ABCF 
AE = BC = DF = ABCDEF 
AF = DE = BCEF = ABCD 
BD = CF = ACDE = ABEF 
BF = CD = ACEF = ABDF 


ACD = BDE = ABF = CEF 
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Table 8-9 

Construction of the 2% 

2 Design with the Generators I 

= ABCE and I = 

BCDF 

Run 

A 

Basic Design 

B C 

D 

E = ABC 

F = BCD 

2 

+ 

~ 

~ 

~ 

+ 

~ 

3 

- 

+ 

- 

- 

+ 

+ 

4 

+ 

+ 

- 

- 

- 

+ 

5 

- 

- 

+ 

- 

+ 

+ 

6 

+ 

- 

+ 

- 

- 

+ 

7 

- 

+ 

+ 

- 

- 

- 

8 

+ 

+ 

+ 

- 

+ 

- 

9 

- 

- 

- 

+ 

- 

+ 

10 

+ 

- 

- 

+ 

+ 

+ 

11 

- 

+ 

- 

+ 

+ 

- 

12 

+ 

+ 

- 

+ 

- 

- 

13 

- 

- 

+ 

+ 

+ 

- 

14 

+ 

- 

+ 

+ 

- 

- 

15 

- 

+ 

+ 

+ 

- 

+ 

16 

+ 

+ 

+ 

+ 

+ 

+ 


are the fractions with generating relationships / = ABCE and / = -BCDF', I = -ABCE 
and I = BCDF\ and I = -ABCE and / = -BCDF. These fractions may be easily 
constructed by the method shown in Table 8-9. For example, if we wish to find the 
fraction for which 1 - ABCE and I = -BCDF, then in the last column of Table 8-9 we 
set F = -BCD, and the column of levels for factor F becomes 
+ +--M-1—I—1—1- 

The complete defining relation for this alternate fraction is / = ABCE = -BCDF = 
-ADEF. Certain signs in the alias structure in Table 8-9 are now changed; for instance, 
the aliases of A are A = BCE = —DEF = -ABCDF. Thus, the linear combination of 
the observations t A actually estimates A + BCE — DEF — ABCDF. 

Finally, note that the 2p7 2 fractional factorial will project into a single replicate of a 
2 4 design in any subset of four factors that is not a word in the defining relation. It also 
collapses to a replicated one-half fraction of a 2 4 in any subset of four factors that is a 
word in the defining relation. Thus, the design in Table 8-9 becomes two replicates of a 
2 4_1 in the factors ABCE, BCDF, and ADEF, because these are the words in the defining 
relation. There are 12 other combinations of the six factors, such as ABCD, ABCF, and 
so on, for which the design projects to a single replicate of the 2 4 . This design also 
collapses to two replicates of a 2 3 in any subset of three of the six factors or four replicates 
of a 2 2 in any subset of two factors. 

In general, any 2 k ~ 2 fractional factorial design can be collapsed into either a full 
factorial or a fractional factorial in some subset of r k - 2 of the original factors. 
Those subsets of variables that form full factorials are not words in the complete defining 
relation. 

EXAMPLE 8-4 . 

Parts manufactured in an injection molding process are showing excessive shrinkage. 
This is causing problems in assembly operations downstream from the injection molding 
area. A quality improvement team has decided to use a designed experiment to study the 
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injection molding process so that shrinkage can be reduced. The team decides to inves¬ 
tigate six factors—mold temperature (A), screw speed ( B ), holding time (C), cycle time 
(D), gate size (E), and holding pressure (F )—each at two levels, with the objective of 
learning how each factor affects shrinkage and also, something about how the factors 
interact. 

The team decides to use the 16-run two-level fractional factorial design in Table 8-9. 
The design is shown again in Table 8-10, along with the observed shrinkage (X 10) for 
the test part produced at each of the 16 runs in the design. Table 8-11 (on the next page) 
shows the effect estimates, sums of squares, and the regression coefficients for this 
experiment. 

A normal probability plot of the effect estimates from this experiment is shown in 
Figure 8-12 (on the facing page). The only large effects are A (mold temperature), B 
(screw speed), and the AB interaction. In light of the alias relationships in Table 8-8, it 
seems reasonable to adopt these conclusions tentatively. The plot of the AB interaction 
in Figure 8-13 (on page 322) shows that the process is very insensitive to temperature if 
the screw speed is at the low level but very sensitive to temperature if the screw speed 
is at the high level. With the screw speed at the low level, the process should produce 
an average shrinkage of around 10 percent regardless of the temperature level chosen. 

Based on this initial analysis, the team decides to set both the mold temperature and 
the screw speed at the low level. This set of conditions will reduce the mean shrinkage 
of parts to around 10 percent. However, the variability in shrinkage from part to part is 
still a potential problem. In effect, the mean shrinkage can be adequately reduced by the 
above modifications; however, the part-to-part variability in shrinkage over a production 
run could still cause problems in assembly. One way to address this issue is to see if any 
of the process factors affect the variability in parts shrinkage. 

Figure 8-14 (on page 322) presents the normal probability plot of the residuals. This 
plot appears satisfactory. The plots of residuals versus each factor were then constructed. 


Table 8-10 A 2f v 2 Design for the Injection Molding Experiment in Example 8-4 


Basic Design 

Run A B C D E = ABC F = BCD 


2 + 
3 


6 + 



10 + 
11 


12 + 

13 

14 + 

15 

16 + 


Observed 
Shrinkage 
(X 10) 

6 

10 

32 

60 

4 

15 
26 
60 

8 

12 

34 

60 

16 

5 

37 

52 
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Table 8-11 Effects, Sums of Squares, and Regression Coefficients for Example 8-4 


Variable 

Name 

-1 Level 

+ 1 Level 


A 

mold_temp 

-1.000 

1.000 


B 

screw_spd 

-1.000 

1.000 


C 

hold_time 

-1.000 

1.000 


D 

cycle_time 

-1.000 

1.000 


E 

gate—size 

-1.000 

1.000 


F 

hold_press 

-1.000 

1.000 


Variable 0 

Regression Coefficient 

Estimated Effect 

Sum of Squares 

Overall Average 

27.3125 




A 

6.9375 

13.8750 


770.062 

B 

17.8125 

35.6250 


5076.562 

C 

-0.4375 

-0.8750 


3.063 

D 

0.6875 

1.3750 


7.563 

E 

0.1875 

0.3750 


0.563 

F 

0.1875 

0.3750 


0.563 

AB + CE 

5.9375 

11.8750 


564.063 

AC + BE 

-0.8125 

-1.6250 


10.562 

AD+ EF 

-2.6875 

-5.3750 


115.562 

AE + BC + DF 

-0.9375 

-1.8750 


14.063 

AF + DE 

0.3125 

0.6250 


1.563 

BD + CF 

-0.0625 

-0.1250 


0.063 

BF + CD 

-0.0625 

-0.1250 


0.063 

ABD 

0.0625 

0.1250 


0.063 

ABF 

-2.4375 

-4.8750 


95.063 


“ Only main effects and two-factor interactions. 





322 


CHAPTER 8 TWO-LEVEL FRACTIONAL FACTORIAL DESIGNS 



Low High 


Mold temperature, A 

Figure 8-13 Plot of AB (mold temperature-screw speed) interaction 
for Example 8-4. 


One of these plots, that for residuals versus factor C (holding time), is shown in Figure 
8-15. The plot reveals that there is much less scatter in the residuals at the low holding 
time than at the high holding time. These residuals were obtained in the usual way from 
a model for predicted shrinkage: 

$ = Po + + At*2 + PllXlX 2 

= 27.3125 + 6.9375*! + 17.8125jc 2 + 5.9315 Xl x 2 
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Figure 8-15 Residuals versus holding time (C) for Example 8-4. 


where x u x 2 , and x,x 2 are coded variables that correspond to the factors A and B and the 
AB interaction. The residuals are then 


e = y - 


y 


The regression model used to produce the residuals essentially removes the location 
effects of A, B, and AB from the data; the residuals therefore contain information about 
unexplained variability. Figure 8-15 indicates that there is a pattern in the variability and 
that the variability in the shrinkage of parts may be smaller when the holding time is at 
the low level (please recall that we observed in Chapter 6 that residuals only convey 
information about dispersion effects when the location or mean model is correct). 

This is further amplified by the analysis of residuals shown in Table 8-12 (on page 
324). In this table, the residuals are arranged at the low (—) and high (+) levels of each 
factor, and the standard deviation of the residuals at the low and high levels of each 
factor have been calculated. Note that the standard deviation of the residuals with C at 
the low level [S(C~) = 1.63] is considerably smaller than the standard deviation of the 
residuals with C at the high level [S(C + ) = 5.70]. 

The bottom line of Table 8-12 presents the statistic 


sV) 

“ s\n 


Recall that if the variances of the residuals at the high (+) and low (—) levels of factor 
i are equal, then this ratio is approximately normally distributed with mean zero, and it 
can be used to judge the difference in the response variability at the two levels of factor 
i. Because the ratio F* is relatively large, we would conclude that the apparent dispersion 
or variability effect observed in Figure 8-15 is real. Thus, setting the holding time at its 
low level would contribute to reducing the variability in shrinkage from part to part 
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F] 


Figure 8-16 Normal probability plot of the dispersion effects F* for Example 8-4. 

during a production run. Figure 8-16 presents a normal probability plot of the F* values 
in Table 8-12; this also indicates that factor C has a large dispersion effect. 

Figure 8-17 shows the data from this experiment projected onto a cube in the factors 
A, B, and C. The average observed shrinkage and the range of observed shrinkage are 
shown at each comer of the cube. From inspection of this figure, we see that running the 
process with the screw speed (B) at the low level is the key to reducing average parts 
shrinkage. If B is low, virtually any combination of temperature (A) and holding time 
(C) will result in low values of average parts shrinkage. However, from examining the 
ranges of the shrinkage values at each comer of the cube, it is immediately clear that 
setting the holding time (C) at the low level is the only reasonable choice if we wish to 
keep the part-to-part variability in shrinkage low during a production run. 



A, mold temperature 


Figure 8-17 Average shrinkage and range of shrinkage in factors 
A, B, and C for Example 8-4. 






n 
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8-4 THE GENERAL 2 k p FRACTIONAL FACTORIAL DESIGN 

A 2 k fractional factorial design containing 2 k p runs is called a l/2 /J fraction of the 2 k 
design or, more simply, a 2 k ~ p fractional factorial design. These designs require the 
selection of p independent generators. The defining relation for the design consists of 
the p generators initially chosen and their 2 P - p - 1 generalized interactions. In this 
section we discuss the construction and analysis of these designs. 

The alias structure may be found by multiplying each effect column by the defining 
relation. Care should be exercised in choosing the generators so that effects of potential 
interest are not aliased with each other. Each effect has 2 P — 1 aliases. For moderately 
large values of k, we usually assume higher-order interactions (say, third- or fourth-order 
and higher) to be negligible, and this greatly simplifies the alias structure. 

It is important to select the p generators for a 2 k ~ p fractional factorial design in such 
a way that we obtain the best possible alias relationships. A reasonable criterion is to 
select the generators such that the resulting 2 k ~ p design has the highest possible reso¬ 
lution. To illustrate, consider the 2fv 2 design in Table 8-9, where we used the generators 
E = ABC and F = BCD, thereby producing a design of resolution IV. This is the 
maximum resolution design. If we had selected E = ABC and F = ABCD, the complete 
defining relation would have been I = ABCE = ABCDF = DEF, and the design would 
be of resolution III. Clearly this is an inferior choice because it needlessly sacrifices 
information about interactions. 

Sometimes resolution alone is insufficient to distinguish between designs. For ex¬ 
ample, consider the three 2jy 2 designs in Table 8-13. All of these designs are of resolution 
IV, but they have rather different alias structures (we have assumed that three-factor and 
higher interactions are negligible) with respect to the two-factor interactions. Clearly 
design A has more aliasing and design C the least, so design C would be the best choice 
for a 2iv 2 . 

The three word lengths in design A are all 4; that is, the word length pattern is {4, 
4,4}. For design B it is {4,4,6), and for design C it is {4,5,5}. Notice that the defining 
relation for design C has only one four-letter word, whereas the other designs have two 
or three. Thus, design C minimizes the number of words in the defining relation that are 
of minimum length. We call such a design a minimum aberration design. Minimizing 
aberration in a design of resolution R ensures that the design has the minimum number 
of main effects aliased with interactions of order R - 1, the minimum number of two- 


Table 8-13 Three Choices of Generators for the 2/ v 2 Design 


Design A Generators: 

F = ABC, G = BCD 

I = ABCF = BCDG = ADFG 

Design B Generators: 

F = ABC, G = ADE 

I = ABCF = ADEG = BCDEFG 

Design C Generators: 

F = ABCD, G = ABDE 

I = ABCDF = ABDEG = CEFG 

Aliases (Two-Factor Interactions) 
AB = CF 

AC = BF 

AD = FG 

AG = DF 

BD = CG 

BG = CD 

AF = BC = DG 

Aliases (Two-Factor Interactions) 
AB = CF 

AC = BF 

AD = EG 

AE = DG 

AF = BC 

AG = DE 

Aliases (Two-Factor Interactions) 
CE = FG 

CF = EG 

CG = EF 
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factor interactions aliased with interactions of order R — 2, and so forth. Refer to Fries 
and Hunter (1980) for more details. 

Table 8-14 (pages 328-330) presents a selection of 2 k ~ p fractional factorial designs 
for k *£ 15 factors and up to n =£ 128 runs. The suggested generators in this table will 
result in a design of the highest possible resolution. These are also the minimum aber¬ 
ration designs. 

The alias relationships for all of the designs in Table 8-14 for which n =£ 64 are 
given in Appendix Table Xll(a-w). The alias relationships presented in this table focus 
on main effects and two- and three-factor interactions. The complete defining relation is 
given for each design. This appendix table makes it very easy to select a design of 
sufficient resolution to ensure that any interactions of potential interest can be estimated. 

EXAMPLE 8-5 . 

To illustrate the use of Table 8-14, suppose that we have seven factors and that we are 
interested in estimating the seven main effects and getting some insight regarding the 
two-factor interactions. We are willing to assume that three-factor and higher interactions 
are negligible. This information suggests that a resolution IV design would be 
appropriate. 

Table 8-14 shows that there are two resolution IV fractions available: the 2Jv 2 with 
32 runs and the 2jy 3 with 16 runs. Appendix Table XII contains the complete alias 
relationships for these two designs. The aliases for the 2jy 3 16-run design are in Appendix 
Table XII(i). Notice that all seven main effects are aliased with three-factor interactions. 
The two-factor interactions are all aliased in groups of three. Therefore, this design will 
satisfy our objectives; that is, it will allow the estimation of the main effects, and it will 
give some insight regarding two-factor interactions. It is not necessary to run the 2jy 2 
design, which would require 32 runs. Appendix Table XII(j) shows that this design would 
allow the estimation of all seven main effects and that 15 of the 21 two-factor interactions 
could also be uniquely estimated. (Recall that three-factor and higher interactions are 
negligible.) This is more information about interactions than is necessary. The complete 
design is shown in Table 8-15 (page 330). Notice that it was constructed by starting with 
the 16-run 2 4 design in A, B, C, and D as the basic design and then adding the three 
columns E = ABC, F = BCD, and G = ACD. The generators are / = ABCE, I = BCDF, 
and I = ACDG (Table 8-14). The complete defining relation is / = ABCE = BCDF = 
ADEF = ACDG = BDEG = CEFG = ABFG. 


Analysis of 2 k ~ p Fractional Factorials 

There are many computer programs that can be used to analyze the 2 k p fractional fac¬ 
torial design. For example, the Design-Expert program illustrated in Chapter 6 has this 
capability. 

The design may also be analyzed by resorting to first principles; the z'th effect is 
estimated by 

_ 2(Contrast,) _ Contrast, 

! “ N “ (A/2) 

where the Contrast, is found using the plus and minus signs in column i and N = 2 k ~ p 
is the total number of observations. The 2 k ~ p design allows only 2 k ~ p — 1 effects (and 
their aliases) to be estimated. 
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Table 8-14 Selected 2 k Fractional Factorial Designs 

Number of Number of Design 

Factors, k Fraction Runs Generators 

3 4 C = ±AB 

4 8 D = ±ABC 

5 2y _1 16 £ = ±ABCD 

2tn 2 8 Z> = ±AB 

E = ±AC 

6 2 V 32 F = ±ABCDE 

2w 2 16 £ = ±ABC 

F = ±BCD 

2nT 3 8 £> = ±AB 

£ = ±AC 
£ = ±£C 

7 2vu‘ 64 G = ±ABCDEF 

2r7 2 32 £ = ±ABCD 

G = ±ABDE 

2?v 3 16 £ = ±ABC 

F = ±BCD 
G = ±ACD 

2lH A 8 D = ±AB 

£ = ±AC 
£ = ±SC 
G = ±ABC 

8 2|T 2 64 G = ±ABCD 

H = ±ABEF 

2fvT 3 32 £ = ±ABC 

G = ±ABD 
H = ±BCDE 

2fv 4 16 £ = ±BCD 

F = ±ACD 
G = ±ABC 
H = ±ABD 

9 2vT 2 128 17 = ±ACDFG 

J= ±BCEFG 

2™ 3 64 G = ±ABCD 

H = ±ACEF 
J = ±CDEF 

2?C 4 32 £ = ±BCDE 

G= ±ACDE 
H= ±ABDE 
J= ±ABCE 

2£r 5 16 £ = ±ABC 

F = ±BCD 
G = ±ACD 
H = ±ABD 
J= ±ABCD 

10 2v 3 128 // = ±ABCG 

J= ±ACDE 
K= ±ACDF 
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Table 8-14 (continued) 
Number of 

Factors, k Fraction 

2jv -4 


11 2 H~ : 


12 2m -8 


13 2i 3 r 9 


Number of Design 

Runs Generators 


64 G = ±BCDF 

H = ±ACDF 
J = ±ABDE 
K= ±ABCE 
32 F = ±ABCD 

G = ±ABCE 
H= ±ABDE 
J = ±ACDE 
K = ±BCDE 
16 E = ±ABC 

F = +BCD 
G = ±ACD 
H= ±ABD 
J = ±ABCD 
K = ±AB 

64 G - ±CDE 

H = ±ABCD 
/= ±ABF 
K = ±BDEF 
L = ±ADEF 
32 F = ±ABC 

G = ±BCD 
H = ±CDE 
J = ±ACD 
K= ±ADE 
L = ±BDE 
16 E = ±ABC 

F = ±BCD 
G = ±ACD 
H = ±ABD 
J = ±ABCD 
K = ±AS 
L = ±AC 

16 £ = ±ABC 

F = ±ABD 
G = ±ACD 
H = ±BCD 
J= ±ABCD 
K= ±AB 
L = ±AC 
M = ±AD 

16 E = ±ABC 

F = ±ABD 
G = ±ACD 
H = ±BCD 
J= ±ABCD 
K= ±AB 
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Table 8-14 (continued) 
Number of 

Factors, k Fraction 

2jv -4 


11 2 H~ : 


12 2m -8 


13 2i 3 r 9 


Number of Design 

Runs Generators 


64 G = ±BCDF 

H = ±ACDF 
J = ±ABDE 
K= ±ABCE 
32 F = ±ABCD 

G = ±ABCE 
H= ±ABDE 
J = ±ACDE 
K = ±BCDE 
16 E = ±ABC 

F = +BCD 
G = ±ACD 
H= ±ABD 
J = ±ABCD 
K = ±AB 

64 G - ±CDE 

H = ±ABCD 
/= ±ABF 
K = ±BDEF 
L = ±ADEF 
32 F = ±ABC 

G = ±BCD 
H = ±CDE 
J = ±ACD 
K= ±ADE 
L = ±BDE 
16 E = ±ABC 

F = ±BCD 
G = ±ACD 
H = ±ABD 
J = ±ABCD 
K = ±AS 
L = ±AC 

16 £ = ±ABC 

F = ±ABD 
G = ±ACD 
H = ±BCD 
J= ±ABCD 
K= ±AB 
L = ±AC 
M = ±AD 

16 E = ±ABC 

F = ±ABD 
G = ±ACD 
H = ±BCD 
J= ±ABCD 
K= ±AB 
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Table 8-14 (continued) 

Number of Number of Design 

Factors, k Fraction Runs Generators 

L = -AC 
M = ±AD 
N= ±BC 

14 2^r 10 16 E = ±ABC 

F = ±ABD 
G = ±ACD 
H = ±BCD 
J= ±ABCD 
K = ±AB 
L = ±AC 
M = ±AD 
N = ±BC 
O = ±BD 

15 2]t n 16 E = ±ABC 

F = ±ABD 
G = ±ACD 
H = ±BCD 
J = ±ABCD 
K = ±AB 
L = ±AC 
M = ±AD 
N = ±BC 
O = ±BD 
P = ±CD 


Table 8-15 A 2? v 3 Fractional Factorial Design 


Run 

A 

Basi 

B 

c Design 

C 

D 

E = ABC 

F = BCD 

G = ACD 

2 

+ 

~ 

~ 

- 

+ 

: 

+ 

3 

- 

+ 

- 

- 

+ 

+ 

- 

4 

+ 

+ 

- 

- 

- 

+ 

+ 

5 

- 

- 

+ 

- 

+ 

+ 

+ 

6 

+ 

- 

+ 

- 

- 

+ 

- 

7 

- 

+ 

+ 

- 

- 

- 

+ 

8 

+ 

+ 

+ 

- 

+ 

- 

- 

9 

- 

- 

- 

+ 

- 

+ 

+ 

10 

+ 

- 

- 

+ 

+ 

+ 

- 

11 

- 

+ 

- 

+ 

+ 

- 

+ 

12 

+ 

+ 

- 

+ 

- 

- 

- 

13 

- 

- 

+ 

+ 

+ 

- 

- 

14 

+ 

- 

+ 

+ 

- 

- 

+ 

15 

- 

+ 

+ 

+ 

- 

+ 

- 

16 

+ 

+ 

+ 

+ 

+ 

+ 

+ 
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Projection of the 2 k ~ p Fractional Factorial 

The 2 k ~ p design collapses into either a full factorial or a fractional factorial in any subset 
of r k - p of the original factors. Those subsets of factors providing fractional factorials 
are subsets appearing as words in the complete defining relation. This is particularly 
useful in screening experiments when we suspect at the outset of the experiment that 
most of the original factors will have small effects. The original 2 k ~ p fractional factorial 
can then be projected into a full factorial, say, in the most interesting factors. Conclusions 
drawn from designs of this type should be considered tentative and subject to further 
analysis. It is usually possible to find alternative explanations of the data involving 
higher-order interactions. 

As an example, consider the 2/ v 3 design from Example 8-5. This is a 16-run design 
involving seven factors. It will project into a full factorial in any four of the original 
seven factors that is not a word in the defining relation. There are 35 subsets of four 
factors, seven of which appear in the complete defining relation (see Table 8-15). Thus, 
there are 28 subsets of four factors that would form 2 4 designs. One combination that is 
obvious upon inspecting Table 8-15 is A, B, C, and D. 

To illustrate the usefulness of this projection properly, suppose that we are con¬ 
ducting an experiment to improve the efficiency of a ball mill and the seven factors are 
as follows: 

1. Motor speed 

2. Gain 

3. Feed mode 

4. Feed sizing 

5. Material type 

6. Screen angle 

7. Screen vibration level 

We are fairly certain that motor speed, feed mode, feed sizing, and material type will 
affect efficiency and that these factors may interact. The role of the other three factors 
is less well-known, but it is likely that they are negligible. A reasonable strategy would 
be to assign motor speed, feed mode, feed sizing, and material type to columns A, B, C, 
and D, respectively, in Table 8-15. Gain, screen angle, and screen vibration level would 
be assigned to columns E, F, and G, respectively. If we are correct and the “minor 
variables” E, F, and G are negligible, we will be left with a full 2 4 design in the key 
process variables. 

Blocking Fractional Factorials 

Occasionally, a fractional factorial design requires so many runs that all of them cannot 
be made under homogeneous conditions. In these situations, fractional factorials may be 
confounded in blocks. Appendix Table XII contains recommended blocking arrange¬ 
ments for many of the fractional factorial designs in Table 8-14. The minimum block 
size for these designs is eight runs. 

To illustrate the general procedure, consider the 2fv 2 fractional factorial design with 
the defining relation I = ABCE = BCDF = ADEF shown in Table 8-10. This fractional 
design contains 16 treatment combinations. Suppose we wish to run the design in two 
blocks of eight treatment combinations each. In selecting an interaction to confound with 
blocks, we note from examining the alias structure in Appendix Table XII(f) that there 
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Block 1 Block 2 


Figure 8-18 The 2?v 2 design in 
two blocks with ABD 
confounded. 


are two alias sets involving only three-factor interactions. The table suggests selecting 
ABD (and its aliases) to be confounded with blocks. This would give the two blocks 
shown in Figure 8-18. Notice that the principal block contains those treatment combi¬ 
nations that have an even number of letters in common with ABD. These are also the 
treatment combinations for which L = x x + x 2 + x 4 = 0 (mod 2). 

EXAMPLE 8-6 . 

A five-axis CNC machine is used to machine an impeller used in a jet turbine engine. 
The blade profiles are an important quality characteristic. Specifically, the deviation of 
the blade profile from the profile specified on the engineering drawing is of interest. An 
experiment is run to determine which machine parameters affect profile deviation. The 
eight factors selected for the design are as follows: 


(i) 

abf 

cef 

bde 

acd 

bcdf 


bef 

be 

df 

abd 


Factor 

Low Level (-) 

High Level (+) 

A = x-Axis shift (0.001 in) 

0 

15 

B = y-Axis shift (0.001 in) 

0 

15 

C = z-Axis shift (0.001 in) 

0 

15 

D = Tool vendor 

1 

2 

E = a-Axis shift (0.001 deg) 

0 

30 

F = Spindle speed (%) 

90 

110 

G = Fixture height (0.001 in) 

0 

15 

H = Feed rate (%) 

90 

110 


One test blade on each part is selected for inspection. The profile deviation is measured 
using a coordinate measuring machine, and the standard deviation of the difference be¬ 
tween the actual profile and the specified profile is used as the response variable. 

The machine has four spindles. Because there may be differences in the spindles, 
the process engineers feel that the spindles should be treated as blocks. 

The engineers feel confident that three-factor and higher interactions are not too 
important, but they are reluctant to ignore the two-factor interactions. From Table 8-14, 
two designs initially appear appropriate: the 2fy 4 design with 16 runs and the 2^ 3 design 
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with 32 runs. Appendix Table XII(l) indicates that if the 16-run design is used, there 
will be fairly extensive aliasing of two-factor interactions. Furthermore, this design can¬ 
not be run in four blocks without confounding four two-factor interactions with blocks. 
Therefore, the experimenters decide to use the 2?v 3 design in four blocks. This confounds 
one three-factor interaction alias chain and one two-factor interaction (EH) and its three- 
factor interaction aliases with blocks. The EH interaction is the interaction between the 
a-axis shift and the feed rate, and the engineers consider an interaction between these 
two variables to be fairly unlikely. 

Table 8-16 contains the design and the resulting responses as standard deviation X 
10 3 in. Because the response variable is a standard deviation, it is often best to perform 


Table 8- 

16 

The 2 8-3 

Design i 

n Four Blocks for Example 

8-6 




Run 

A 

Basic Design 

BCD 

E F 

= ABC 

G = ABD H = 

BCDE 

Block 

Actual Run 
Order 

Standard 
Deviation 
(X 10 3 in) 

1 

- 

- 

- - 

- 

- 

- 

+ 

3 

18 

2.76 

2 

+ 

- 

- - 

- 

+ 

+ 

+ 

2 

16 

6.18 

3 

- 

+ 

- - 

- 

+ 

+ 

- 

4 

29 

2.43 

4 

+ 

+ 

- - 

- 

- 

- 

- 

1 

4 

4.01 

5 

- 

- 

+ - 

- 

+ 

- 

- 

1 

6 

2.48 

6 

+ 

- 

+ - 

- 

- 

+ 

- 

4 

26 

5.91 

7 

- 

+ 

+ - 

- 

- 

+ 

+ 

2 

14 

2.39 

8 

+ 

+ 

+ - 

- 

+ 

- 

+ 

3 

22 

3.35 

9 

- 

- 

- + 

- 

- 

+ 

- 

1 

8 

4.40 

10 

+ 

- 

- + 

- 

+ 

- 

- 

4 

32 

4.10 

11 

- 

+ 

- + 

- 

+ 

- 

+ 

2 

15 

3.22 

12 

+ 

+ 

- + 

- 

- 

+ 

+ 

3 

19 

3.78 

13 

- 

- 

+ + 

- 

+ 

+ 

+ 

3 

24 

5.32 

14 

+ 

- 

+ + 

- 

- 

- 

+ 

2 

11 

3.87 

15 

- 

+ 

+ + 

- 

- 

- 

- 

4 

27 

3.03 

16 

+ 

+ 

+ + 

- 

+ 

+ 

- 

1 

3 

2.95 

17 

- 

- 

- - 

+ 

- 

- 

- 

2 

10 

2.64 

18 

+ 

- 

- - 

+ 

+ 

+ 

- 

3 

21 

5.50 

19 

- 

+ 

- - 

+ 

+ 

+ 

+ 

1 

7 

2.24 

20 

+ 

+ 

- - 

+ 

- 

- 

+ 

4 

28 

4.28 

21 

- 

- 

+ - 

+ 

+ 

- 

+ 

4 

30 

2.57 

22 

+ 

- 

+ - 

+ 

- 

+ 

+ 

1 

2 

5.37 

23 

- 

+ 

+ 

+ 

- 

+ 

- 

3 

17 

2.11 

24 

+ 

+ 

■ - 

+ 

+ 

- 

- 

2 

13 

4.18 

25 

- 

- 

- + 

+ 

- 

+ 

+ 

4 

25 

3.96 

26 

+ 

- 

- + 

+ 

+ 

- 

+ 

1 

1 

3.27 

27 

- 

+ 

- + 

+ 

+ 

- 

- 

3 

23 

3.41 

28 

+ 

+ 

- + 

+ 

- 

+ 

- 

2 

12 

4.30 

29 

- 

- 

+ + 

+ 

+ 

+ 

- 

2 

9 

4.44 

30 

+ 

- 

+ + 

+ 

- 

- 

- 

3 

20 

3.65 

31 

- 

+ 

+ + 

+ 

- 

- 

+ 

1 

5 

4.41 

32 


+ 

+ + 

+ 

+ 

+ 

+ 

4 

31 

3.40 
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Table 8-17 Effect Estimates, Regression Coefficients, and Sums of Squares for Example 8-6 


Variable 

Name 

-1 Level 

+ 1 Level 

A 

x-Axis shift 

0 

15 


B 

y-Axis shift 

0 

15 


C 

z-Axis shift 

0 

15 


D 

Tool vendor 

1 

2 


E 

a-Axis shift 

0 

30 


F 

Spindle speed 

90 

110 


G 

Fixture height 

0 

15 


H 

Feed rate 

90 

110 


Variable 3 

Regression Coefficient 

Estimated Effect 


Sum of Squares 

Overall Average 

1.28007 




A 

0.14513 

0.29026 


0.674020 

B 

-0.10027 

-0.20054 


0.321729 

C 

-0.01288 

-0.02576 


0.005310 

D 

0.05407 

0.10813 


0.093540 

E 

—2.531E-04 

—5.063E-04 


2.050E-06 

F 

-0.01936 

-0.03871 


0.011988 

G 

0.05804 

0.11608 


0.107799 

H 

0.00708 

0.01417 


0.001606 

AB + CF + DG 

-0.00294 

-0.00588 


2.767E-04 

AC + BF 

-0.03103 

-0.06206 


0.030815 

AD+ BG 

-0.18706 

-0.37412 


1.119705 

AE 

0.00402 

0.00804 


5.170E-04 

AF + BC 

-0.02251 

-0.04502 


0.016214 

AG + BD 

0.02644 

0.05288 


0.022370 

AH 

-0.02521 

-0.05042 


0.020339 

BE 

0.04925 

0.09851 


0.077627 

BH 

0.00654 

0.01309 


0.001371 

CD +FG 

0.01726 

0.03452 


0.009535 

CE 

0.01991 

0.03982 


0.012685 

CG + DF 

-0.00733 

-0.01467 


0.001721 

CH 

0.03040 

0.06080 


0.029568 

DE 

0.00854 

0.01708 


0.002334 

DH 

0.00784 

0.01569 


0.001969 

EF 

-0.00904 

-0.01808 


0.002616 

EG 

-0.02685 

-0.05371 


0.023078 

EH 

-0.01767 

-0.03534 


0.009993 

FH 

-0.01404 

-0.02808 


0.006308 

GH 

0.00245 

0.00489 


1.914E-04 

ABE 

0.01665 

0.03331 


0.008874 

ABH 

-0.00631 

-0.01261 


0.001273 

ACD 

-0.02717 

-0.05433 


0.023617 


“ Only main effects and two-factor interactions. 
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Effect estimates 

Figure 8-19 Normal probability plot of the effect estimates for Example 8- 


the analysis following a log transformation. The effect estimates are shown in Table 
8-17. Figure 8-19 is a normal probability plot of the effect estimates, using In (standard 
deviation X 10 3 ) as the response variable. The only large effects are A = x-axis shift, 
B = y-axis shift, and the alias chain involving AD + BG. Now AD is the x-axis shift- 
tool vendor interaction, and BG is the y-axis shift-fixture height interaction, and since 
these two interactions are aliased it is impossible to separate them based on the data from 
the current experiment. Since both interactions involve one large main effect it is also 
difficult to apply any “obvious” simplifying logic to the situation either. If there is some 
engineering knowledge or process knowledge available that sheds light on the situation, 
then perhaps a choice could be made between the two interactions; otherwise, more data 
will be required to separate these two effects (the problem of adding runs to a fractional 
factorial to de-alias interactions is discussed in Section 8-5 and in the Supplemental 
Material for this chapter). 

Suppose that process knowledge suggests that the appropriate interaction is likely to 
be AD. Table 8-18 is the resulting analysis of variance for the model with factors A, B, 


Table 8-18 Analysis of Variance for Example 8-6 

Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Square 

F 0 

P-Value 

A 

0.6740 

1 

0.6740 

39.42 

<0.0001 

B 

0.3217 

1 

0.3217 

18.81 

0.0002 

D 

0.0935 

1 

0.0935 

5.47 

0.0280 

AD 

1.1197 

1 

1.1197 

65.48 

<0.0001 

Blocks 

0.0201 

3 

0.0067 



Error 

0.4099 

24 

0.0171 



Total 

2.6389 

31 
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1.247 1.273 



Figure 8-22 The 2? v 3 design in Example 8-6 projected into four 
replicates of a 2 3 design in factors A, B, and D. 


D and AD (factor D was included to preserve the hierarchy principle). Notice that the 
block effect is small, suggesting that the machine spindles are not very different. 

Figure 8-20 (on the facing page) is a normal probability plot of the residuals from 
this experiment. This plot is suggestive of slightly heavier than normal tails, so possibly 
other transformations should be considered. The AD interaction plot is in Figure 8-21 
(on the facing page). Notice that tool vendor (D) and the magnitude of the x-axis shift 
(A) have a profound impact on the variability of the blade profile from design specifi¬ 
cations. Running A at the low level (0 offset) and buying tools from vendor 1 gives the 
best results. Figure 8-22 shows the projection of this 2^7 3 design into four replicates of 
a 2 3 design in factors A, B, and D. The best combination of operating conditions is A at 
the low level (0 offset), B at the high level (0.015 in offset), and D at the low level (tool 
vendor 1). 


8-5 RESOLUTION III DESIGNS 

As indicated earlier, the sequential use of fractional factorial designs is very useful, often 
leading to great economy and efficiency in experimentation. We now illustrate these 
ideas using the class of resolution III designs. 

It is possible to construct resolution IE designs for investigating up to k = N - 1 
factors in only A runs, where A is a multiple of 4. These designs are frequently useful 
in industrial experimentation. Designs in which A is a power of 2 can be constructed by 
the methods presented earlier in this chapter, and these are presented first. Of particular 
importance are designs requiring 4 runs for up to 3 factors, 8 runs for up to 7 factors and 
16 runs for up to 15 factors. If k = A - 1, the fractional factorial design is said to be 
saturated. 

A design for analyzing up to three factors in four runs is the 2 3 ,f 1 design, presented 
in Section 8-2. Another very useful saturated fractional factorial is a design for studying 
seven factors in eight runs, that is, the 2j„ 4 design. This design is a one-sixteenth fraction 
of the 2 7 . It may be constructed by first writing down as the basic design the plus and 
minus levels for a full 2 3 design in A, B, and C and then associating the levels of four 
additional factors with the interactions of the original three as follows: D = AB, 
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Table 8-19 The 2 h 7 4 Design with the Generators I = ABD, I = ACE, I = BCF, and 1 = ABCG 


C D=AB E = AC F = BC G = ABC 


bcf 

abcdefg 


E = AC, F = BC, and G = ABC. Thus, the generators for this design are I = ABD, 
/ = ACE, I = BCF, and / = ABCG. The design is shown in Table 8-19. 

The complete defining relation for this design is obtained by multiplying the four 
generators ABD, ACE, BCF, and ABCG together two at a time, three at a time, and four 
at a time, yielding 

I = ABD = ACE = BCF = ABCG = BCDE = ACDF = CDG 
= ABEF = BEG = AFG = DEF = ADEG = CEFG = BDFG = ABCDEFG 
To find the aliases of any effect, simply multiply the effect by each word in the defining 
relation. For example, the aliases of B are 

B = AD = ABCE = CF = ACG = CDE = ABCDF = BCDG = AEF = EG 
= ABFG = BDEF = ABDEG = BCEFG = DFG = ACDEFG 
This design is a one-sixteenth fraction, and because the signs chosen for the gener¬ 
ators are positive, this is the principal fraction. It is also of resolution III because the 
smallest number of letters in any word of the defining contrast is three. Any one of the 
16 different 2 7 m 4 designs in this family could be constructed by using the generators with 
1 of the 16 possible arrangements of signs in I = ±ABD, I = ±ACE, I = ±BCF, I = 
±ABCG. 

The seven degrees of freedom in this design may be used to estimate the seven main 
effects. Each of these effects has 15 aliases; however, if we assume that three-factor and 
higher interactions are negligible, then considerable simplification in the alias structure 
results. Making this assumption, each of the linear combinations associated with the 
seven main effects in this design actually estimates the main effect and three two-factor 
interactions: 

i A A + BD + CE + FG 
€ B ^ B + AD + CF + EG 
€ c —> C + AE + BF + DG 
D + AB + CG + EF 
Z E -> E + AC + BG + DF 
£ f —>F + BC + AG + DE 
i G = G + CD + BE + AF 


( 8 - 1 ) 
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These aliases are found in Appendix Table XII(h), ignoring three-factor and higher 
interactions. 

The saturated 2n7 4 design in Table 8-19 can be used to obtain resolution III designs 
for studying fewer than seven factors in eight runs. For example, to generate a design 
for six factors in eight runs, simply drop any one column in Table 8-19, for example, 
column G. This produces the design shown in Table 8-20. 

It is easy to verify that this design is also of resolution IE; in fact, it is a 2fh~ 3 , or a 
one-eighth fraction, of the 2 6 design. The defining relation for the 2f,7 3 design is equal 
to the defining relation for the original 2] n 4 design with any words containing the letter 
G deleted. Thus, the defining relation for our new design is 

I = ABD = ACE = BCF = BCDE = ACDF = ABEF = DEF 
In general, when d factors are dropped to produce a new design, the new defining relation 
is obtained as those words in the original defining relation that do not contain any dropped 
letters. When constructing designs by this method, care should be exercised to obtain the 
best arrangement possible. If we drop columns B, D, F, and G from Table 8-19, we 
obtain a design for three factors in eight runs, yet the treatment combinations correspond 
to two replicates of a 2 3 design. The experimenter would probably prefer to run a full 2 3 
design in A, C, and E. 

It is also possible to obtain a resolution III design for studying up to 15 factors in 
16 runs. This saturated 2^~ n design can be generated by first writing down the 16 
treatment combinations associated with a 2 4 design in A, B, C, and D and then equating 
11 new factors with the two-, three-, and four-factor interactions of the original four. In 
this design, each of the 15 main effects is aliased with seven two-factor interactions. A 
similar procedure can be used for the 2m -26 design, which allows up to 31 factors to be 
studied in 32 runs. 

Sequential Assembly of Fractions to Separate Effects 

By combining fractional factorial designs in which certain signs are switched, we can 
systematically isolate effects of potential interest. This type of sequential experiment is 
called a fold over of the original design. The alias structure for any fraction with the 
signs for one or more factors reversed is obtained by making changes of sign on the 
appropriate factors in the alias structure of the original fraction. 

Consider the 277 4 design in Table 8-19. Suppose that along with this principal frac- 


Table 8-20 A 2fn 3 Design with the Generators I = ABD, 1 = ACE, and I = BCF 


def 

af 


bcf 

abcdef 
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tion a second fractional design with the signs reversed in the column for factor D is also 
run. That is, the column for D in the second fraction is 

-++-++- 

The effects that may be estimated from the first fraction are shown in Equation 8-1, and 
from the second fraction we obtain 


A - BD + CE + FG 
€£ -»• B - AD + CF + EG 
C + AE + BF - DG 
V d ^D - AB - CG - EF 

i.e., t'-D —> ~D + AB + CG + EF (8-2) 

t' E E + AC + BG - DF 
£p ^ F + BC + AG - DE 
€c G - CD + BE + AF 

assuming that three-factor and higher interactions are insignificant. Now from the two 
linear combinations of effects !(€, + €,') and $(€,• — €') we obtain 


/■ 

From j(€, + €') 

From 5(€,- - f') 

A 

A+ CE + FG 

BD 

B 

B + CF +EG 

AD 

C 

C+AE + BF 

DG 

D 

D 

AB + CG + EF 

E 

E + AC + BG 

DF 

F 

F + BC + AG 

DE 

G 

G + BE+ AF 

CD 


Thus, we have isolated the main effect of D and all of its two-factor interactions. In 
general, if we add to a fractional design of resolution III or higher a further fraction with 
the signs of a single factor reversed, then the combined design will provide estimates of 
the main effect of that factor and its two-factor interactions. 

Now suppose we add to a resolution m fractional a second fraction in which the 
signs for all the factors are reversed. This type of fold over (sometimes called a full fold 
over or a reflection) breaks the alias links between main effects and two-factor interac¬ 
tions. That is, we may use the combined design to estimate all the main effects clear of 
any two-factor interactions. The following example illustrates the technique. 

EXAMPLE 8-7 . 

A human performance analyst is conducting an experiment to study eye focus time and 
has built an apparatus in which several factors can be controlled during the test. The 
factors he initially regards as important are acuity or sharpness of vision (A), distance 
from target to eye ( B ), target shape (C), illumination level (£>), target size (£), target 
density (F), and subject (G). Two levels of each factor are considered. He suspects that 
only a few of these seven factors are of major importance and that high-order interactions 
between the factors can be neglected. On the basis of this assumption, the analyst decides 
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Table 8-21 A 2j u 4 Design for the Eye Focus Time Experiment 


Run 

Basic Design 

ABC 

D = AB 

E = AC 

F = BC 

G = ABC 


Time 

1 

- 

- 

- 

+ 

+ 

+ 

- 

def 

85.5 

2 

+ 

- 

- 

- 

- 

+ 

+ 

afg 

75.1 

3 

- 

+ 

- 

- 

+ 

- 

+ 

beg 

93.2 

4 

+ 

+ 

- 

+ 

- 

- 

- 

abd 

145.4 

5 

- 

- 

+ 

+ 

- 

- 

+ 

cdg 

83.7 

6 

+ 

- 

+ 

- 

+ 

- 

- 

ace 

77.6 

7 

- 

+ 

+ 

- 

- 

+ 

- 

bcf 

95.0 

8 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

abcdefg 

141.8 


to run a screening experiment to identify the most important factors and then to concen¬ 
trate further study on those. To screen these seven factors, he runs the treatment com¬ 
binations from the 2hj 4 design in Table 8-19 in random order, obtaining the focus times 
in milliseconds, as shown in Table 8-21. 

Seven main effects and their aliases may be estimated from these data. From Equa¬ 
tion 8-1, we see that the effects and their aliases are 

i A = 20.63 -> A + BD + CE + FG 
( B = 38.38 B + AD + CF + EG 
€ c = -0.28 C + AE + BF + DG 
i D = 28.88 -*D + AB + CG + EF 
l E = -0.28 —» £ + AC + BG + DF 
i F = -0.63 -* £ + BC + AG + DE 
i G = -2.43 ^G + CD + BE + AF 

For example, 

t A = $(-85.5 + 75.1 - 93.2 + 145.4 - 83.7 + 77.6 - 95.0 + 141.8) = 20.63 

The three largest effects are t A , £ B , and i D . The simplest interpretation of the data is that 
the main effects of A, B, and D are all significant. However, this interpretation is not 
unique, because one could also logically conclude that A, B, and the AB interaction, or 
perhaps B, D, and the BD interaction, or perhaps A, D, and the AD interaction are the 
hue effects. 

Notice that ABD is a word in the defining relation for this design. Therefore, this 
2ln 4 design does not project into a full 2 3 factorial in ABD\ instead, it projects into two 
replicates of a 2 3_ 1 design, as shown in Figure 8-23 (on the next page). Because the 2 3 “ 1 
design is a resolution III design, A will be aliased with BD, B will be aliased with AD, 
and D will be aliased with AB, so the interactions cannot be separated from the main 
effects. The analyst here may have been unlucky. If he had assigned illumination level 
to C instead of D, the design would have projected into a full 2 3 design, and the inter¬ 
pretation could have been simpler. 

To separate the main effects and the two-factor interactions, a second fraction is run 
with all the signs reversed. This fold-over design is shown in Table 8-22 (on the next 
page) along with the observed responses. Notice that when we fold over a resolution III 
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Figure 8-23 The 2/„ 4 design projected into 
two replicates of a 2fu 1 design in A, B, and D. 


design in this manner, we (in effect) change the signs on the generators that have an odd 
number of letters. The effects estimated by this fraction are 

V A = -17.68 -> A — BD - CE - FG 

i' B = 37.73 B — AD - CF - EG 

Vc = -3.33 C — AE - BF - DG 

t' D = 29.88 —> D - AB - CG — EF 

i' E = 0.53 -* E — AC — BG - DF 

i’ F = 1.63 -> F - BC - AG - DE 

t’ G = 2.68 -> G — CD - BE - AF 

By combining this second fraction with the original one, we obtain the following esti¬ 
mates of the effects: 




i 

From + €') From §(€, - i\ 

?) 





A 

A = 1.48 

BD + CE + FG = 

19.15 





B 

B = 38.05 

AD + CF + EG = 

0.33 





C 

C = -1.80 

' AE + BF + DG = 

1.53 





D 

D= 29.38 

AB + CG + EF = 

-0.50 





E 

E = 0.13 

AC + BG + DF = 

-0.40 





F 

F = 0.50 

BC + AG + DE = 

-1.53 





G 

G = 0.13 

CD + BE + AF = 

-2.55 



Table 8-22 

afo: 

[d-Ove 

r 2j n 4 Design for the Eye Focus Time Experiment 



Bas 

lie Design 






Run A 

B 

C 

D = —AB h 

; = -AC F = -BC c 

? = ABC 


Time 

1 + 

+ 

+ 

- 

- 

+ 

abeg 

91.3 

2 

+ 

+ 

+ 

+ - 

- 

bede 

136.7 

3 + 

- 

+ 

+ 

+ 

- 

aedf 

82.4 

4 

- 

+ 

- 

+ + 

+ 

cefg 

73.4 

5 + 

+ 

- 

- 

+ + 

- 

abef 

94.1 

6 

+ 

- 

+ 

+ 

+ 

bdfg 

143.8 

7 + 

- 

- 

+ 

+ 

+ 

adeg 

87.3 

8 

- 

- 

- 

- 

- 

(1) 

71.9 
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The two largest effects are B and D. Furthermore, the third largest effect is BD + 
CE + FG, so it seems reasonable to attribute this to the BD interaction. The analyst used 
the two factors distance (B) and illumination level (D) in subsequent experiments with 
the other factors A, C, E, and F at standard settings and verified the results obtained here. 
He decided to use subjects as blocks in these new experiments rather than ignore a 
potential subject effect because several different subjects had to be used to complete the 
experiment. 


The Defining Relation for a Fold-Over Design 

Combining fractional factorial designs via fold over as demonstrated in Example 8-7 is 
a very useful technique. It is often of interest to know the defining relation for the 
combined design. It can be easily determined. Each separate fraction will have L + U 
words used as generators: L words of like sign and U words of unlike sign. The combined 
design will have L + U — 1 words used as generators. These will be the L words of like 
sign and the U - 1 words consisting of independent even products of the words of unlike 
sign. (Even products are words taken two at a time, four at a time, and so forth.) 

To illustrate this procedure, consider the design in Example 8-7. For the first fraction, 
the generators are 

I = ABD, I = ACE, I = BCF, and / = ABCG 
and for the second fraction, they are 

I = -ABD, I = -ACE, / = -BCF, and 1 = ABCG 
Notice that in the second fraction we have switched the signs on the generators with an 
odd number of letters. Also, notice that L + t/=l + 3 = 4. The combined design will 
have I = ABCG (the like sign word) as a generator and two words that are independent 
even products of the words of unlike sign. For example, take / = ABD and / = ACE\ 
then I = ( ABD)(ACE ) = BCDE is a generator of the combined design. Also, take I = 
ABD and / = BCF; then I = ( ABD)(BCF ) = ACDF is a generator of the combined 
design. The complete defining relation for the combined design is 

/ = ABCG = BCDE = ACDF = ADEG = BDFG = ABEF = CEFG 
Plackett-Burman Designs 

These designs, attributed to Plackett and Burman (1946), are two-level fractional factorial 
designs for studying k = N — 1 variables in N runs, where A is a multiple of 4. If N is 
a power of 2, these designs are identical to those presented earlier in this section. How¬ 
ever, for N = 12, 20, 24, 28, and 36, the Plackett-Burman designs are sometimes of 
interest. Because these designs cannot be represented as cubes, they are sometimes called 
nongeometric designs. 

The upper half of Table 8-23 (on the next page) presents rows of plus and minus 
signs that are used to construct the Plackett-Burman designs for N = 12, 20, 24, and 
36, whereas the lower half of the table presents blocks of plus and minus signs for 
constructing the design for N = 28. The designs for N = 12, 20, 24, and 36 are obtained 
by writing the appropriate row in Table 8-23 as a column (or row). A second column (or 
row) is then generated from this first one by moving the elements of the column (or row) 
down (or to the right) one position and placing the last element in the first position. A 
third column (or row) is produced from the second similarly, and the process is continued 
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Table 8-23 Plus and Minus Signs for the Plackett-Burman Designs 


k = 

k = 

i fc = 

11, IV = 12 + + - + + +-+ - 

19, IV = 20 + +-+ + + + - + - 

23, N = 24 +++++-+-++- 

- +-+ + - 

-- + +-+ - +- 






k = 27, N = 28 


+-++++- 

- +-+-+ 

++-+-++-+ 


++-+++- 

-+ +-+- 

-++++-++- 


-+++++- 

+-+-+ - 

+ - + - + H— + + 


-+-++++ 

-+ - +-+ 

+-+++-+-+ 


-++-+++ 

+-+ +- 

++-++++- 


-+++++ 

- + - +-+ - 

-+++-+-++ 


+ + +-+ - + 

-+-+-+- 

+-++-+++- 


+ + +-+ + - 

+-+-+ 

+ + - + +-+• + 


+ + +-+ + 

-+-+-+- 

-++-+++-+ 


until column (or row) k is generated. A row of minus signs is then added, completing 
the design. For N = 28, the three blocks X, Y, and Z are written down in the order 
X Y Z 
Z X Y 
Y Z X 

and a row of minus signs is added to these 27 rows. The design for N = 12 runs and 
k = 11 factors is shown in Table 8-24. 

The nongeometric Plackett-Burman designs for N = 12, 20, 24, 28, and 36 have 
very messy alias structures. For example, in the 12-run design every main effect is par¬ 
tially aliased with every two-factor interaction not involving itself. For example, the AB 
interaction is aliased with the nine main effects C, D ,..., K. Furthermore, each main 
effect is partially aliased with 45 two-factor interactions. In the larger designs, the 
situation is even more complex. We advise the experimenter to use these designs very 
carefully. 


Table 8-24 Plackett-Burman Design for N = 12, k = 11 
Run A B C D E F 


12 
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(a) Projection into three factors 



(6) Projection into four factors 

Figure 8-24 Projection of the 12-run Plackett-Burman design into three- 
and four-factor designs. 


The projective properties of the nongeometric Plackett-Burman designs are not ter¬ 
ribly attractive. For example, consider the 12-run design in Table 8-24. This design will 
project into three replicates of a full 2 2 design in any two of the original 11 factors. 
However, in three factors, the projected design is a full 2 3 factorial plus a 2;V 1 fractional 
factorial (see Figure 8-24<a). Thus, the resolution IH Plackett-Burman design has pro- 
jectivity 3, meaning it will collapse into a full factorial in any subset of three factors. 
The 2n7 p design only has projectivity 2. The four-dimensional projections are shown in 
Figure 8-24 b. Notice that these three- and four-factor projections are not balanced 
designs. 

EXAMPLE 8-8 . 

We will illustrate some of the potential difficulties associated with the Plackett-Burman 
designs by using the 11-variable, 12-run design and a set of simulated data. We will 
assume that the process has three significant main effects ( A, B, D) and two significant 
two-factor interactions (AB and AD). The model is 

y = 200 + 8*] + 10x 2 + 12*4 — \2x x x 2 + 9x 1 x 4 + e 
where each x, is a coded variable defined on the — 1, +1 interval and e is an NID(0, 9) 
random error term. Therefore, three of the k = 11 factors are large, and there are two 
large interactions; not an unreasonable situation. 

Table 8-25 (page 346) presents the 12-run Plackett-Burman design and the simulated 
responses. This design looks different from the 12-run design in Table 8-24 because it 
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Table 8-25 Plackett-Burman Design for Example 8-8 


Run 

A 

B 

C 

D 

E 

F 

G 

H 

/ 

K 

L 

Response 

1 

+ 

+ 

- 

+ 

+ 

+ 

- 

- 

- 

+ 

- 

231 

2 

- 

+ 

+ 

- 

+ 

+ 

+ 

- 

- 

- 

+ 

207 

3 

+ 

- 

+ 

+ 

- 

4- 

+ 

+ 

- 

- 

- 

230 

4 

- 

+ 

- 

+ 

+ 

- 

+ 

+ 

+ 

- 

- 

217 

5 

- 

- 

+ 

- 

+ 

+ 

- 

+ 

+ 

+ 

- 

175 

6 

- 

- 

- 

+ 

- 

+ 

+ 

- 

+ 

+ 

+ 

176 

7 

+ 

- 

- 

- 

+ 

- 

+ 

+ 

- 

+ 

+ 

183 

8 

+ 

+ 

- 

- 

- 

+ 

- 

+ 

+ 

- 

+ 

185 

9 

+ 

+ 

+ 

- 

- 

- 

+ 

- 

+ 

+ 

- 

181 

10 

- 

+ 

+ 

+ 

- 

- 

- 

+ 

- 

+ 

+ 

220 

11 

+ 

- 

+ 

+ 

+ 

- 

- 

- 

+ 

- 

+ 

229 

12 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

168 


was constructed using the row of signs for k = 11, N = 12 in Table 8-23 as a row. The 
effect estimates are shown in Table 8-26. Notice that there are seven large effects; A, B, 
C, D, E, J, and K (and, of course, their aliases). It is not immediately obvious that some 
of these effects could be interactions. Some of this ambiguity could be resolved by folding 
over the design. This will generally resolve the main effects, but it often still leaves the 
experimenter uncertain about interaction effects. 


The difficulty in interpretation in a Plackett-Burman design, illustrated in the above 
example, occurs fairly often in practice. If your choice is between a geometric 2 I 1 n -7 
design with 16 runs or a 12-run Plackett-Burman design that may have to be folded over 
(thereby requiring 24 runs), the geometric design may turn out to be a better choice. For 
more details, see Montgomery, Borror, and Stanley (1997-98). Under some conditions, 


Table 8-26 Effect Estimates, Regression Coefficients, and Sums of Squares for Example 8-8 


Variable" 

Regression Coefficient 

Estimated Effect 

Sum of Squares 

Overall Average 

200.167 



A 

6.333 

12.667 

481.333 

B 

6.667 

13.333 

533.333 

C 

6.833 

12.667 

560.333 

D 

17.000 

34.000 

3468.000 

E 

6.833 

13.667 

560.333 

F 

0.500 

1.000 

3.000 

G 

-1.167 

-2.333 

16.333 

H 

1.500 

3.000 

27.000 

J 

-6.333 

-12.667 

481.333 

K 

-5.833 

-11.667 

408.333 

L 

-0.167 

-0.333 

0.333 


" Every main effect is partially aliased with 45 two-factor interactions. 
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we can untangle the aliases in a nongeometric Plackett-Burman design by using regres¬ 
sion model-building techniques. This is discussed by Hamada and Wu (1992). 


8-6 RESOLUTION IV AND V DESIGNS 

A 2 k ~ p fractional factorial design is of resolution IV if the main effects are clear of two- 
factor interactions and some two-factor interactions are aliased with each other. Thus, if 
three-factor and higher interactions are suppressed, the main effects may be estimated 
directly in a 2\y p design. An example is the 2?v 2 design in Table 8-10. Furthermore, the 
two combined fractions of the 2uf 4 design in Example 8-7 yields a 2jy 3 design. 

Any 2iy p design must contain at least 2k runs. Resolution IV designs that contain 
exactly 2k runs are called minimal designs. Resolution IV designs may be obtained from 
resolution III designs by the process of fold over. Recall that to fold over a 2V’ design, 
simply add to the original fraction a second fraction with all the signs reversed. Then the 
plus signs in the identity column / in the first fraction could be switched in the second 
fraction, and a (k + l)st factor could be associated with this column. The result is a 
2\y 1 ~ p fractional factorial design. The process is demonstrated in Table 8-27 for the 
2V design. It is easy to verify that the resulting design is a 2fv 1 design with defining 
relation / = ABCD. 

It is also possible to fold over resolution IV designs to separate two-factor interac¬ 
tions that are aliased with each other. Montgomery and Runger (1996) observe that an 
experimenter may have several objectives in folding over a resolution IV design, such 
as (1) break as many two-factor interaction alias chains as possible, (2) break the two- 
factor interactions on a specific alias chain, or (3) break the two-factor interactions in¬ 
volving a specific factor. One way to fold over a resolution IV design is to run a second 
fraction in which the sign is reversed on every design generator that has an even number 
of letters. To illustrate, consider the 2?v 2 design used for the injection molding experiment 
in Example 8-4. The generators for the design in Table 8-10 are / = ABCE and / = 
BCDF. The second fraction would use the generators I = -ABCE and I = -BCDF, and 


Table 8-27 A 2f v 1 Design Obtained 
by Fold Over 


D 




I 

A 

B 

C 


Original 2 V 

1 = ABC 


+ 

- 

- 

+ 

+ 

+ 

- 

- 

+ 

- 

+ 

- 

+ 

+ 

+ 

#■; 

Second 2^ 1 with Signs Switched 

- 

+ 

+ 

- 

- 

- 

+ 

+ 

- 

+ 

- 

+ 

- 

- 

- 
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the single generator for the combined design would be / = ADEF. Thus, the combined 
design is still a resolution IV fractional factorial design. However, the alias relationships 
will be much simpler than in the original 2fv 2 design. In fact, the only two-factor inter¬ 
actions that will be aliased are AD = EF, AE = DF, and AF = DE. All the other two- 
factor interactions can be estimated from the combined design. 

As another example, consider the 2? v 3 design with 32 runs. Table 8-14 indicates that 
the best set of generators for this design is I = ABCF, I =ABDG, and I = BCDEH. 
Appendix Table XII(m) shows the aliases for this design. Notice that there are six pairs 
of two-factor interactions and one group of three two-factor interactions that are aliases. 
If we fold over this design, the second fraction would have the generators / = -ABCF, 
I - -ABDG, and / = BCDEH. The combined design has the generators I = CDFG and 
I = BCDEH, and the complete defining relation is 

I = CDFG = BCDEH = BEFGH 

The combined design is of resolution IV, but the only two-factor interactions that are 
still aliased are CD = FG, CF = DG, and CG = DF. This is a considerable simplification 
of the aliasing in the original fraction. 

Notice that when we start with a resolution HI design, the fold-over procedure guar¬ 
antees that the combined design will be of resolution IV, thereby ensuring that all the 
main effects can be separated from their two-factor interaction aliases. When folding 
over a resolution IV design, we will not necessarily separate all the two-factor interac¬ 
tions. In fact, if the original fraction has an alias structure with more than two two-factor 
interactions in any alias chain, folding over will not completely separate all the two- 
factor interactions. Both of the foregoing examples, the 2?v 2 and the 2? v 3 , have at least 
one such two-factor interaction alias chain. Montgomery and Runger (1996) give a table 
of recommended fold-over designs for resolution IV fractions with 6 =£ k =£ 10 factors. 

Resolution V designs are fractional factorials in which the main effects and the two- 
factor interactions do not have other main effects and two-factor interactions as their 
aliases. These are very powerful designs, allowing the unique estimation of all the main 
effects and two-factor interactions, provided that all the three-factor and higher interac¬ 
tions are negligible. The smallest word in the defining relation of such a design must 
have five letters. The 2 5_ 1 design with the generating relation I = ABCDE is of resolution 
V. Another example is the 2%~ 2 design with the generating relations I = ABCDG and 
I = ABEFH. Further examples of these designs are given by Box and Hunter (1961b). 

Because the standard resolution V designs are large designs when the number of 
factors is moderately large, there is some practical interest in irregular resolution V 
fractional factorials. These are useful designs for 4 k =£ 9 factors. The 24-run design 
for k = 5 factors is shown in Table 8-28. Because this is a resolution V design, all five 
main effects and all 10 two-factor interactions can be estimated, assuming that three- 
factor and higher interactions are negligible. The design for k = 4 factors has 12 runs 
and is discussed in Problem 8-22. For k = 6, 7, and 8 these designs have 48 runs, and 
the nine-factor design has 96 runs. The Design-Expert software package contains all of 
these designs. 

Finally, we point out that a complete fold over of a resolution IV or V design is 
usually unnecessary. Generally, there are only one or two (or a very few) aliased inter¬ 
actions that are potentially of interest. These interactions can usually be de-aliased by 
adding a smaller number of runs to the original fraction. This technique is sometimes 
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Table 8-28 The Irregular Resolution V 
Fraction for Five Factors 
in 24 Runs 

A B C D E 



called a partial fold over. Refer to Example 10-5 and the supplemental text material for 
this chapter for some insight regarding how this is done. 

8-7 SUMMARY 

This chapter has introduced the 2 k ~ p fractional factorial design. We have emphasized the 
use of these designs in screening experiments to quickly and efficiently identify the subset 
of factors that are active and to provide some information on interaction. The projective 
property of these designs makes it possible in many cases to examine the active factors 
in more detail. Sequential assembly of these designs via fold over is a very effective way 
to gain additional information about interactions that an initial experiment may identify 
as possibly important. 

In practice, 2 k p fractional factorial designs with N = 4, 8,16, and 32 runs are highly 
useful. Table 8-29 (page 350) summarizes these designs, identifying how many factors 
can be used with each design to obtain various types of screening experiments. For 
example, the 16-run design is a full factorial for 4 factors, a one-half fraction for 5 factors, 
a resolution IV fraction for 6 to 8 factors, and a resolution III fraction for 9 to 15 factors. 
All of these designs may be constructed using the methods discussed in this chapter, and 
many of their alias structures are shown in Appendix Table XII. 
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Table 8-29 Useful Factorial and Fractional Factorial Designs 
from the 2 k ~ p System. The Numbers in the Cells 
are the Number of Factors in the Experiment 


Design Type 


Number of Runs 


4 

8 

16 

32 

Full factorial 

2 

3 

4 

5 

Half-fraction 

3 

4 

5 

6 

Resolution IV fraction 

— 

4 

6-8 

7-16 

Resolution III fraction 

3 

5-7 

9-15 

17-31 


8-8 PROBLEMS-- 

8-1. Suppose that in the chemical process development experiment described in Problem 6-7, 

it was only possible to run a one-half fraction of the 2 4 design. Construct the design and 
perform the statistical analysis, using the data from replicate I. 

8-2. Suppose that in Problem 6-15, only a one-half fraction of the 2 4 design could be run. 
Construct the design and perform the analysis, using the data from replicate I. 

8-3. Consider the plasma etch experiment described in Problem 6-18. Suppose that only a one- 
half fraction of the design could be run. Set up the design and analyze the data. 

8-4. Problem 6-21 describes a process improvement study in the manufacturing process of an 
integrated circuit. Suppose that only eight runs could be made in this process. Set up an 
appropriate 2 5 ~ 2 design and find the alias structure. Use the appropriate observations from 
Problem 6-21 as the observations in this design and estimate the factor effects. What 
conclusions can you draw? 

8-5. Continuation of Problem 8-4. Suppose you have made the eight runs in the 2 5 ~ 2 design 
in Problem 8-4. What additional runs would be required to identify the factor effects that 
are of interest? What are the alias relationships in the combined design? 

8-6. R. D. Snee (“Experimenting with a Large Number of Variables,” in Experiments in 
Industry: Design, Analysis and Interpretation of Results, by R. D. Snee, L. B. Hare, and 
J. B. Trout, Editors, ASQC, 1985) describes an experiment in which a 2 5_1 design with 
I = ABCDE was used to investigate the effects of five factors on the color of a chemical 
product. The factors are A = solvent/reactant, B = catalyst/reactant, C = temperature, 
D = reactant purity, and £ = reactant pH. The results obtained were as follows: 


e = -0.63 
a = 2.51 

b = -2.68 
abe = 1.66 

c = 2.06 

ace = 1.22 

bee = -2.09 
abc = 1.93 


d = 6.79 
ade - 5.47 
bde = 3.45 
abd = 5.68 
ede = 5.22 
acd = 4.38 
bed = 4.30 
abode = 4.05 


(a) Prepare a normal probability plot of the effects. Which effects seem active? 

(b) Calculate the residuals. Construct a normal probability plot of the residuals and plot 
the residuals versus the fitted values. Comment on the plots. 

(c) If any factors are negligible, collapse the 2 5_1 design into a full factorial in the active 
factors. Comment on the resulting design, and interpret the results. 

8-7. An article by J. J. Pignatiello, Jr. and J. S. Ramberg in the Journal of Quality Technology 
(Vol. 17, 1985, pp. 198-206) describes the use of a replicated fractional factorial to 
investigate the effect of five factors on the free height of leaf springs used in an automotive 
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application. The factors are A = furnace temperature, B = heating time, C = transfer 
time, D = hold down time, and E = quench oil temperature. The data are shown below: 


4 

B 

C 

D 

E 


Free Height 


- 

- 

- 

- 

- 

7.78 

7.78 

7.81 

+ 

- 

- 

+ 

- 

8.15 

8.18 

7.88 

- 

+ 

- 

+ 

- 

7.50 

7.56 

7.50 

+ 

+ 

- 

- 

- 

7.59 

7.56 

7.75 

- 

- 

+ 

+ 

- 

7.54 

8.00 

7.88 

+ 

- 

+ 

- 

- 

7.69 

8.09 

8.06 

- 

+ 

+ 

- 

- 

7.56 

7.52 

7.44 

+ 

+ 

+ 

+ 

- 

7.56 

7.81 

7.69 

- 

- 

- 

- 

+ 

7.50 

7.25 

7.12 

+ 

- 

- 

+ 

+ 

7.88 

7.88 

7.44 

- 

+ 

- 

+ 

+ 

7.50 

7.56 

7.50 

+ 

+ 

- 

- 

+ 

7.63 

7.75 

7.56 

- 

- 

+ 

+ 

+ 

7.32 

7.44 

7.44 

+ 

- 

+ 

- 

+ 

7.56 

7.69 

7.62 

- 

+ 

+ • 

- 

+ 

7.18 

7.18 

7.25 

+ 

+ 

+ 

+ 

+ 

7.81 

7.50 

7.59 


(a) Write out the alias structure for this design. What is the resolution of this design? 

(b) Analyze the data. What factors influence the mean free height? 

(c) Calculate the range and standard deviation of the free height for each run. Is there 
any indication that any of these factors affects variability in the free height? 

(d) Analyze the residuals from this experiment, and comment on your findings. 

(e) Is this the best possible design for five factors in 16 runs? Specifically, can you find 
a fractional design for five factors in 16 runs with a higher resolution than this one? 

8-8. An article in Industrial and Engineering Chemistry (“More on Planning Experiments to 
Increase Research Efficiency,” 1970, pp. 60-65) uses a 2 5 ~ 2 design to investigate the 
effect of A = condensation temperature, B = amount of material 1, C = solvent volume, 
D = condensation time, and E = amount of material 2 on yield. The results obtained are 
as follows: 

e = 23.2 ad = 16.9 cd = 23.8 bde = 16.8 

ab = 15.5 be = 16.2 ace = 23.4 abede = 18.1 

(a) Verify that the design generators used were I = ACE and I = BDE. 

(b) Write down the complete defining relation and the aliases for this design. 

(c) Estimate the main effects. 

(d) Prepare an analysis of variance table. Verify that the AB and AD interactions are 
available to use as error. 

(e) Plot the residuals versus the fitted values. Also construct a normal probability plot 
of the residuals. Comment on the results. 

8-9. Consider the leaf spring experiment in Problem 8-7. Suppose that factor E (quench oil 
temperature) is very difficult to control during manufacturing. Where would you set factors 
A, B, C, and D to reduce variability in the free height as much as possible regardless of 
the quench oil temperature used? 

8-10. Construct a 2 7-2 design by choosing two four-factor interactions as the independent gen¬ 
erators. Write down the complete alias structure for this design. Outline the analysis of 
variance table. What is the resolution of this design? 
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8-11. Consider the 2 5 design in Problem 6-21. Suppose that only a one-half fraction could be 
run. Furthermore, two days were required to take the 16 observations, and it was necessary 
to confound the 2 5-1 design in two blocks. Construct the design and analyze the data. 

8-12. Analyze the data in Problem 6-23 as if it came from a 2^7' design with I = ABCD. Project 
the design into a full factorial in the subset of the original four factors that appear to be 
significant. 

8-13. Repeat Problem 8-12 using 7 = -ABCD. Does use of the alternate fraction change your 
interpretation of the data? 

8-14. Project the 1 design in Example 8-1 into two replicates of a 2 2 design in the factors A 
and B. Analyze the data and draw conclusions. 

8-15. Construct a 2?„ 3 design. Determine the effects that may be estimated if a second fraction 
of this design is run with all signs reversed. 

8-16. Consider the 2f„ 3 design in Problem 8-15. Determine the effects that may be estimated 
if a second fraction of this design is run with the signs for factor A reversed. 

8-17. Fold over the 2m 4 design in Table 8-19 to produce an eight-factor design. Verify that the 
resulting design is a 2?7" 4 design. Is this a minimal design? 

8-18. Fold over a 2jj7 2 design to produce a six-factor design. Verify that the resulting design is 
a 2fv 2 design. Compare this design to the 2?y 2 design in Table 8-10. 

8-19. An industrial engineer is conducting an experiment using a Monte Carlo simulation model 
of an inventory system. The independent variables in her model are the order quantity 
(A), the reorder point (B), the setup cost (C), the backorder cost (£>), and the carrying cost 
rate (E). The response variable is average annual cost. To conserve computer time, she 
decides to investigate these factors using a 2fn 2 design with I = ABD and / = BCE. 
The results she obtains are de = 95, ae = 134, b = 158, abd = 190, cd = 92, ac = 187, 
bee = 155, and abede = 185. 

(a) Verify that the treatment combinations given are correct. Estimate the effects, as¬ 
suming three-factor and higher interactions are negligible. 

(b) Suppose that a second fraction is added to the first, for example, ade = 136, e = 
93, ab = 187, bd = 153, acd = 139, c = 99, abce = 191, and bede = 150. How 
was this second fraction obtained? Add this data to the original fraction, and estimate 
the effects. 

(c) Suppose that the fraction abc = 189, ce = 96, bed = 154, aede = 135, abe = 193, 
bde = 152, ad = 137, and (1) = 98 was run. How was this fraction obtained? Add 
this data to the original fraction and estimate the effects. 

8-20. Construct a 2 5 “ 1 design. Show how the design may be run in two blocks of eight obser¬ 
vations each. Are any main effects or two-factor interactions confounded with blocks? 

8-21. Construct a 2 7 2 design. Show how the design may be run in four blocks of eight obser¬ 
vations each. Are any main effects or two-factor interactions confounded with blocks? 

8-22. Irregular fractions of the 2 k [John (1971)]. Consider a 2 4 design. We must estimate the 
four main effects and the six two-factor interactions, but the full 2 4 factorial cannot be 
run. The largest possible block size contains 12 runs. These 12 runs can be obtained from 
the four one-quarter replicates defined by / = ±AB = ±ACD = ±BCD by omitting the 
principal fraction. Show how the remaining three 2 4-2 fractions can be combined to es¬ 
timate the required effects, assuming three-factor and higher interactions are negligible. 
T his design could be thought of as a three-quarter fraction. 

8-23. Carbon anodes used in a smelting process are baked in a ring furnace. An experiment is 
run in the furnace to determine which factors influence the weight of packing material 
that is stuck to the anodes after baking. Six variables are of interest, each at two levels: 
A = pitch/fines ratio (0.45, 0.55), B = packing material type (1, 2), C = packing material 
temperature (ambient, 325°C), D = flue location (inside, outside), E = pit temperature 
(ambient, 195°C), and F = delay time before packing (zero, 24 hours). A 2 6 ~ 3 design is 
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run, and three replicates are obtained at each of the design points. The weight of packing 
material stuck to the anodes is measured in grams. The data in run order are as follows: 
abd = (984, 826, 936); abcdef= (1275,976,1457); be = (1217, 1201, 890); af= (1474, 
1164, 1541); def= (1320, 1156, 913); cd = (765, 705, 821); ace = (1338, 1254, 1294); 
and bcf = (1325,1299,1253). We wish to minimize the amount of stuck packing material. 

(a) Verify that the eight runs correspond to a 2f„ 3 design. What is the alias structure? 

(b) Use the average weight as a response. What factors appear to be influential? 

(c) Use the range of the weights as a response. What factors appear to be influential? 

(d) What recommendations would you make to the process engineers? 

8-24. A 16-run experiment was performed in a semiconductor manufacturing plant to study the 
effects of six factors on the curvature or camber of the substrate devices produced. The 
six variables and their levels are shown below: 


Run 

Lamination Lamination 

Temperature Time 

(°C) (s) 

Lamination Firing Cycle 

Pressure Temperature Time 

(tn) (°C) (h) 

Dew § 

Point 

(°C) 

1 

55 


10 

5 

1580 

17.5 

20 

2 

75 


10 

5 

1580 

29 

26 

3 

55 


25 

5 

1580 

29 

20 

4 

75 


25 

5 

1580 

17.5 

26 

5 

55 


10 

10 

1580 

29 

26 

6 

75 


10 

10 

1580 

17.5 

20 

7 

55 


25 

10 

.1580 

17.5 

26 

8 

75 


25 

10 

1580 

29 

20 

9 

55 


10 

5 

1620 

17.5 

26 

10 

75 


10 

5 

1620 

29 

20 

11 

55 


25 

5 

1620 

29 

26 

12 

75 


25 

5 

1620 

17.5 

20 

13 

55 


10 

10 

1620 

29 

20 

14 

75 


10 

10 

1620 

17.5 

26 

15 

55 


25 

10 

1620 

17.5 

20 

16 

75 


25 

10 

1620 

29 

26 

Each run was replicated four times, and a camber measurement v 
The data are shown below: 

/as taken on the substrate. 


Camber for Replicate (in/in) 

Total 

Mean 

Standard 

Run 

1 

2 

3 

4 

(10 4 in/in) 

(10 4 in/in) 

Deviation 

1 

0.0167 

0.0128 

0.0149 

0.0185 

629 

157.25 

24.418 

2 

0.0062 

0.0066 

0.0044 

0.0020 

192 

48.00 

20.976 

3 

0.0041 

0.0043 

0.0042 

0.0050 

176 

44.00 

4.083 

4 

0.0073 

0.0081 

0.0039 

0.0030 

223 

55.75 

25.025 

5 

0.0047 

0.0047 

0.0040 

0.0089 

223 

55.75 

22.410 

6 

0.0219 

0.0258 

0.0147 

0.0296 

920 

230.00 

63.639 

7 

0.0121 

0.0090 

0.0092 

0.0086 

389 

97.25 

16.029 

8 

0.0255 

0.0250 

0.0226 

0.0169 

900 

225.00 

39.42 

9 

0.0032 

0.0023 

0.0077 

0.0069 

201 

50.25 

26.725 

10 

0.0078 

0.0158 

0.0060 

0.0045 

341 

85.25 

50.341 
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Run 

Camber for Replicate (in/in) 

12 3 4 

Total 

(10 4 in/in) 

Mean 

(10 4 in/in) 

Standard 

Deviation 

11 

0.0043 

0.0027 

0.0028 

0.0028 

126 

31.50 

7.681 

12 

0.0186 

0.0137 

0.0158 

0.0159 

640 

160.00 

20.083 

13 

0.0110 

0.0086 

0.0101 

0.0158 

455 

113.75 

31.12 

14 

0.0065 

0.0109 

0.0126 

0.0071 

371 

92.75 

29.51 

15 

0.0155 

0.0158 

0.0145 

0.0145 

603 

150.75 

6.75 

16 

0.0093 

0.0124 

0.0110 

0.0133 

460 

115.00 

17.45 


(a) What type of design did the experimenters use? 

(b) What are the alias relationships in this design? 

(c) Do any of the process variables affect average camber? 

Table 8-30 Data for Problem 8-25 


Resist Thickness 


Run Volume Batch Time, s Speed Acc. Cover Left Center Right Avg. Range 


1 5 Batch 2 

2 5 Batch 1 

3 3 Batch 1 

4 3 Batch 2 

5 3 Batch 1 

6 5 Batch 1 

7 3 Batch 1 

8 5 Batch 2 

9 5 Batch 1 

10 3 Batch 1 

11 3 Batch 2 

12 3 Batch 1 

13 5 Batch 1 

14 3 Batch 1 

15 5 Batch 2 

16 5 Batch 2 

17 3 Batch 2 

18 3 Batch 1 

19 5 Batch 2 

20 3 Batch 2 

21 5 Batch 1 

22 3 Batch 2 

23 5 Batch 1 

24 3 Batch 2 

25 5 Batch 1 

26 3 Batch 2 

27 3 Batch 1 

28 5 Batch 2 

29 5 Batch 1 

30 5 Batch 2 

31 5 Batch 2 

32 3 Batch 2 


14 7350 5 

6 7350 5 

6 6650 5 

14 7350 20 

14 7350 5 

6 6650 20 

6 7350 5 

14 6650 20 

14 6650 5 

14 6650 5 

14 6650 20 

6 7350 20 

6 6650 5 

6 6650 20 

14 7350 20 

6 7350 5 

14 7350 5 

14 6650 20 

6 7350 20 

6 7350 5 

14 6650 20 

6 6650 5 

14 7350 20 

6 7350 20 

14 7350 5 

6 6650 20 

14 7350 20 

6 6650 5 

6 7350 20 

6 6650 20 

14 6650 5 

14 6650 5 


Off 4531 4531 

Off 4446 4464 

Off 4452 4490 

Off 4316 4328 

Off 4307 4295 

Off 4470 4492 

On 4496 4502 

Off 4542 4547 

Off 4621 4643 

On 4653 4670 

On 4480 4486 

Off 4221 4233 

On 4620 4641 

On 4455 4480 

On 4255 4288 

On 4490 4534 

On 4514 4551 

Off 4494 4503 

Off 4293 4306 

Off 4534 4545 

On 4460 4457 

On 4650 4688 

Off 4231 4244 

On 4225 4228 

On 4381 4391 

Off 4533 4521 

On 4194 4230 

Off 4666 4695 

On 4180 4213 

On 4465 4496 

On 4653 4685 

Off 4683 4712 


4515 4525.7 16 

4428 4446 36 

4452 4464.7 38 

4308 4317.3 20 

4289 4297 18 

4495 4485.7 25 

4482 4493.3 20 

4538 4542.3 9 

4613 4625.7 30 

4645 4656 25 

4470 4478.7 16 

4217 4223.7 16 

4619 4626.7 22 

4466 4467 25 

4243 4262 45 

4523 4515.7 44 

4540 4535 37 

4496 4497.7 9 

4302 4300.3 13 

4512 4530.3 33 

4436 4451 24 

4656 4664.7 38 

4230 4235 14 

4208 4220.3 20 

4376 4382.7 15 

4511 4521.7 22 

4172 4198.7 58 

4672 4677.7 29 

4197 4196.7 33 

4463 4474.7 33 

4665 4667.7 32 

4677 4690.7 35 
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(d) Do any of the process variables affect the variability in camber measurements? 

(e) If it is important to reduce camber as much as possible, what recommendations would 
you make? 

8-25. A spin coater is used to apply photoresist to a bare silicon wafer. This operation usually 
occurs early in the semiconductor manufacturing process, and the average coating thick¬ 
ness and the variability in the coating thickness has an important impact on downstream 
manufacturing steps. Six variables are used in the experiment. The variables and their 
high and low levels are as follows: 


Factor 

Low Level 

High Level 

Final spin speed 

7350 rpm 

6650 rpm 

Acceleration rate 

5 

20 

Volume of resist applied 

3 cc 

5 cc 

Time of spin 

14 s 

6s 

Resist batch variation 

Batch 1 

Batch 2 

Exhaust pressure 

Cover off 

Cover on 


The experimenter decides to use a 2 6-1 design and to make three readings on resist thick¬ 
ness on each test wafer. The data are shown in Table 8-30. 

(a) Verify that this is a 2 6_1 design. Discuss the alias relationships in this design. 

(b) What factors appear to affect average resist thickness? 

(c) Because the volume of resist applied has little effect on average thickness, does this 
have any important practical implications for the process engineers? 

(d) Project this design into a smaller design involving only the significant factors. Graph¬ 
ically display the results. Does this aid in interpretation? 

(e) Use the range of resist thickness as a response variable. Is there any indication that 
any of these factors affect the variability in resist thickness? 

(f) Where would you recommend that the process engineers run the process? 

8-26. Harry and Judy Peterson-Nedry (two friends of the author) own a vineyard and winery in 
Newberg, Oregon. They grow several varieties of grapes and manufacture wine. Harry 
and Judy have used factorial designs for process and product development in the wine¬ 
making segment of their business. This problem describes the experiment conducted for 
their 1985 Pinot Noir. Eight variables, shown below, were originally studied in this 
experiment: 


Variable 

Low Level (— ) 

High Level (+) 

A = Pinot Noir clone 

Pommard 

Wadenswil 

B = Oak type 

Allier 

Troncais 

C = Age of barrel 

Old 

New 

D = Yeast/skin contact 

Champagne 

Montrachet 

E = Stems 

None 

All 

F = Barrel toast 

Light 

Medium 

G = Whole cluster 

None 

10% 

H = Fermentation temperature 

Low (75°F max) 

High (92°F max) 


Harry and Judy decided to use a 2fy 4 design with 16 runs. The wine was taste-tested by 
a panel of experts on March 8, 1986. Each expert ranked the 16 samples of wine tasted, 
with rank 1 being the best. The design and the taste-test panel results are shown in Table 
8-31 on page 356. 

(a) What are the alias relationships in the design selected by Harry and Judy? 

(b) Use the average ranks (y) as a response variable. Analyze the data and draw con- 
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elusions. You will find it helpful to examine a normal probability plot of the effect 
estimates. 

(c) Use the standard deviation of the ranks (or some appropriate transformation such as 
log s) as a response variable. What conclusions can you draw about the effects of 
the eight variables on variability in wine quality? 

(d) After looking at the results, Harry and Judy decide that one of the panel members 
(DCM) knows more about beer than he does about wine, so they decide to delete 
his ranking. What affect would this have on the results and conclusions from parts 
(b) and (c)? 

(e) Suppose that just before the start of the experiment, Harry and Judy discovered that 
the eight new barrels they ordered from France for use in the experiment would not 
arrive in time, and all 16 runs would have to be made with old barrels. If Harry and 
Judy just drop column C from their design, what does this do to the alias relation¬ 
ships? Do they need to start over and construct a new design? 

(f) Harry and Judy know from experience that some treatment combinations are unlikely 
to produce good results. For example, the run with all eight variables at the high 
level generally results in a poorly rated wine. This was confirmed in the March 8, 
1986 taste test. They want to set up a new design for their 1986 Pinot Noir using 
these same eight variables, but they do not want to make the run with all eight factors 
at the high level. What design would you suggest? 

8-27. In an article in Quality Engineering (“An Application of Fractional Factorial Experimental 

Designs,” 1988, Vol. 1, pp. 19-23) M. B. Kilgo describes an experiment to determine 
the effect of C0 2 pressure (A), C0 2 temperature (B), peanut moisture (C), C0 2 flow rate 
(D), and peanut particle size ( E ) on the total yield of oil per batch of peanuts (y). The 
levels that she used for these factors are as follows: 


Coded 

Level 

A, 

Pressure 

(bar) 

B, 

Temp. 

(°C) 

C, Moisture 
(% by weight) 

D, Flow 
(liters/min) 

E, 

Part. Size 
(mm) 

-1 

415 

25 

5 

40 

1.28 

1 

550 

95 

15 

60 

4.05 


She conducted the 16-run fractional factorial experiment shown below. 


415 25 5 

550 25 5 

415 95 5 

550 95 5 

415 25 15 

550 25 15 

415 95 15 

550 95 15 

415 25 5 

550 25 5 

415 95 5 

550 95 5 

415 25 15 

550 25 15 

415 95 15 

550 95 15 


40 1.28 63 

40 4.05 21 

40 4.05 36 

40 1.28 99 

40 4.05 24 

40 1.28 66 

40 1.28 71 

40 4.05 54 

60 4.05 23 

60 1.28 74 

60 1.28 80 

60 4.05 33 

60 1.28 63 

60 4.05 21 

60 4.05 44 

60 1.28 96 
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(a) What type of design has been used? Identify the defining relation and the alias 
relationships. 

(b) Estimate the factor effects and use a normal probability plot to tentatively identify 
the important factors. 

(c) Perform an appropriate statistical analysis to test the hypotheses that the factors 
identified in part (b) above have a significant effect on the yield of peanut oil. 

(d) Fit a model that could be used to predict peanut oil yield in terms of the factors that 
you have identified as important. 

(e) Analyze the residuals from this experiment and comment on model adequacy. 

8-28. A 16-run fractional factorial experiment in 10 factors on sand-casting of engine manifolds 

was conducted by engineers at the Essex Aluminum Plant of the Ford Motor Company 
and described in the article “Evaporative Cast Process 3.0 Liter Intake Manifold Poor 
Sandfill Study,” by D. Becknell {Fourth Symposium on Taguchi Methods , American 
Supplier Institute, Dearborn, MI, 1986, pp. 120-130). The purpose was to determine which 
of 10 factors has an effect on the proportion of defective castings. The design and the 
resulting proportion of nondefective castings p observed on each run are shown below. 
This is a resolution III fraction with generators E = CD, F = BD, G = BC, H = AC, 
J = AB, and K = ABC. Assume that the number of castings made at each run in the 
design is 1000. 


Run 

A 

B 

C 

D 

E 

F 

G 

H 


K 

P 

arcsin Vp 

F&T’s 

Modification 

1 

_ 

_ 

_ 

_ 

+ 

+ 

+ 

+ 

+ 

- 

0.958 

1.364 

1.363 

2 

+ 

_ 

_ 

- 

+ 

+ 

+ 

- 

- 

+ 

1.000 

1.571 

1.555 

3 

_ 

+ 

_ 

_ 

+ 

- 

- 

+ 

- 

+ 

0.977 

1.419 

1.417 

4 

+ 

+ 

_ 

- 

+ 

- 

- 

- 

+ 

- 

0.775 

1.077 

1.076 

5 

_ 

_ 

+ 

- 

_ 

+ 

- 

- 

+ 

+ 

0.958 

1.364 

1.363 

6 

+ 

_ 

+ 

- 

- 

+ 

- 

+ 

- 

- 

0.958 

1.364 

1.363 

7 

_ 

+ 

+ 

_ 

_ 

- 

+ 

- 

- 

- 

0.813 

1.124 

1.123 

8 

+ 

+ 

+ 

- 

- 

- 

+ 

+ 

+ 

+ 

0.906 

1.259 

1.259 

9 

_ 

_ 

_ 

+ 

- 

- 

+ 

+ 

+ 

- 

0.679 

0.969 

0.968 

10 

+ 

_ 

_ 

+ 

- 

- 

+ 

- 

- 

+ 

0.781 

1.081 

1.083 

11 

_ 

+ 

_ 

+ 

- 

+ 

- 

+ 

- 

+ 

1.000 

1.571 

1.556 

12 

+ 

+ 

_ 

+ 

- 

+ 

- 

- 

+ 

- 

0.896 

1.241 

1.242 

13 

_ 

_ 

+ 

+ 

+ 

- 

- 

- 

+ 

+ 

0.958 

1.364 

1.363 

14 

+ 

_ 

+ 

+ 

+ 

- 

- 

+ 

- 

- 

0.818 

1.130 

1.130 

15 

_ 

+ 

+ 

+ 

+ 

+ 

+ 

- 

- 

- 

0.841 

1.161 

1.160 

16 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

0.955 

1.357 

1.356 


(a) Find the defining relation and the alias relationships in this design. 

(b) Estimate the factor effects and use a normal probability plot to tentatively identify 
the important factors. 

(c) Fit an appropriate model using the factors identified in part (b) above. 

(d) Plot the residuals from this model versus the predicted proportion of nondefective 
castings. Also prepare a normal probability plot of the residuals. Comment on the 
adequacy of these plots. 

(e) In part (d) you should have noticed an indication that the variance of the response 
is not constant (considering that the response is a proportion, you should have ex¬ 
pected this). The previous table also shows a transformation on p, the arcsin square 
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root, that is a widely used variance stabilizing transformation for proportion data 
(refer to the discussion of variance stabilizing transformations in Chapter 3). Repeat 
parts (a) through (d) above using the transformed response and comment on your 
results. Specifically, are the residual plots improved? 

(f) There is a modification to the arcsin square root transformation, proposed by Free¬ 
man and Tukey (“Transformations Related to the Angular and the Square Root,” 
Annals of Mathematical Statistics, Vol. 21, 1950, pp. 607-611) that improves its 
performance in the tails. F&T’s modification is: 

[arcsin Vnp/(n + 1) + arcsinV(np + 1)/(« + l)]/2 

Rework parts (a) through (d) using this transformation and comment on the results. 
(For an interesting discussion and analysis of this experiment, refer to “Analysis of 
Factorial Experiments with Defects or Defectives as the Response,” by S. Bisgaard 
and H. T. Fuller, Quality Engineering, Vol. 7, 1994-95, pp. 429-443.) 

8-29. A 16-run fractional factorial experiment in nine factors was conducted by Chrysler Motors 
Engineering and described in the article “Sheet Molded Compound Process Improve¬ 
ment,” by P. I. Hsieh and D. E. Goodwin ( Fourth Symposium on Taguchi Methods, 
American Supplier Institute, Dearborn, MI, 1986, pp. 13-21). The purpose was to reduce 
the number of defects in the finish of sheet-molded grill opening panels. The design, and 
the resulting number of defects, c, observed on each run, is shown below. This is 
a resolution III fraction with generators E = BD, F = BCD, G = AC, H = ACD, and 
J = AB. 


Run 

A 

B 

C 

D 

E 

F 

G 

H 

J 

c 

V? 

F&T’s 

Modification 

1 

- 

- 

- 

- 

+ 

- 

+ 

- 

+ 

56 

7.48 

7.52 

2 

+ 

- 

- 

- 

+ 

- 

- 

+ 

- 

17 

4.12 

4.18 

3 

- 

+ 

- 

- 

- 

+ 

+ 

- 

- 

2 

1.41 

1.57 

4 

+ 

-1- 

- 

- 

- 

+ 

- 

+ 

+ 

4 

2.00 

2.12 

5 

- 

- 

+ 

- 

+ 

+ 

- 

+ 

+ 

3 

1.73 

1.87 

6 

+ 

- 

+ 

- 

+ 

+ 

+ 

- 

- 

4 

2.00 

2.12 

7 

- 

+ 

+ 

- 

- 

- 

- 

+ 

- 

50 

7.07 

7.12 

8 

+ 

+ 

+ 


- 

- 

+ 

- 

+ 

2 

1.41 

1.57 

9 

- 

- 

- 

- 

- 

+ 

+ 

+ 

+ 

1 

1.00 

1.21 

10 

+ 

- 

- 

+ 

- 

+ 

- 

- 

- 

0 

0.00 

0.50 

11 

- 

+ 

- 

+ 

+ 

- 

+ 

+ 

- 

3 

1.73 

1.87 

12 

+ 

+ 

- 

+ 

+ 

- 

- 

- 

+ 

12 

3.46 

3.54 

13 

- 

- 

+ 

+ 

- 

- 

- 

- 

+ 

3 

1.73 

1.87 

14 

+ 

- 

+ 

+ 

- 

- 

+ 

+ 

- 

4 

2.00 

2.12 

15 

- 

+ 

+ 

+ 

+ 

+ 

- 

- 

- 

0 

0.00 

0.50 

16 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

0 

0.00 

0.50 


(a) Find the defining relation and the alias relationships in this design. 

(b) Estimate the factor effects and use a normal probability plot to tentatively identify 
the important factors. 

(c) Fit an appropriate model using the factors identified in part (b) above. 

(d) Plot the residuals from this model versus the predicted number of defects. Also, 
prepare a normal probability plot of the residuals. Comment on the adequacy of these 
plots. 
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(e) In part (d) you should have noticed an indication that the variance of the response 
is not constant (considering that the response is a count, you should have expected 
this). The previous table also shows a transformation on c, the square root, that is a 
widely used variance stabilizing transformation for count data (refer to the discus¬ 
sion of variance stabilizing transformations in Chapter 3). Repeat parts (a) through 
(d) using the transformed response and comment on your results. Specifically, are 
the residual plots improved? 

(f) There is a modification to the square root transformation, proposed by Freeman and 
Tukey (“Transformations Related to the Angular and the Square Root,” Annals of 
Mathematical Statistics, Vol. 21,1950, pp. 607-611) that improves its performance. 
F&T’s modification to the square root transformation is 


[Vc + 


Rework parts (a) through (d) using this transformation and comment on the results. 
(For an interesting discussion and analysis of this experiment, refer to “Analysis of 
Factorial Experiments with Defects or Defectives as the Response,” by S. Bisgaard 
and H. T. Fuller, Quality Engineering, Vol. 7, 1994-1995, pp. 429-443.) 

8-30. An experiment is run in a semiconductor factory to investigate the effect of six factors on 
transistor gain. The design selected is the 2^ 2 shown below: 


Standard Run 

Order Order A B C D E F Gain 


2 

3 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 



1455 

1511 

1487 

1596 

1430 

1481 

1458 

1549 

1454 

1517 

1487 

1596 

1446 

1473 

1461 

1563 


(a) Use a normal plot of the effects to identify the significant factors. 

(b) Conduct appropriate statistical tests for the model identified in part (a). 

(c) Analyze the residuals and comment on your findings. 

(d) Can you find a set of operating conditions that produce gain of 1500 ± 25? 

8-31. Heat treating is often used to carbonize metal parts, such as gears. The thickness of the 

carbonized layer is a critical output variable from this process, and it is usually measured 
by performing a carbon analysis on the gear pitch (the top of the gear tooth). Six factors 
were studied in a 2% 2 design: A = furnace temperature, B = cycle time, C = carbon 
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concentration, D = duration of the carbonizing cycle, E = carbon concentration of the 
diffuse cycle, and F = duration of the diffuse cycle. The experiment is shown below: 


Standard Run 

Order Order A B C D E F Pitch 

1 5 - -- -- - 74 


2 7 + 

3 8 - + 

4 2 + + 


5 10 

6 12 + 

7 16 - + 

8 1 + + 

9 6 - - 

10 9 + - 

11 14 - + 

12 13 + + 

13 11 

14 3 + - 

15 15 - + 

16 4 + + 


- - + - 190 

- - + + 133 

- - - + 127 

+ - + + 115 

+ - - + 101 

+ - - - 54 

+ - + 144 

- + - + 121 

- + + + 188 

- + + - 135 

- + - - 170 

+ + + - 126 

+ + - - 175 

+ + - + 126 

+ + + + 193 


(a) Estimate the factor effects and plot them on a normal probability plot. Select a 
tentative model. 

(b) Perform appropriate statistical tests on the model. 

(c) Analyze the residuals and comment on model adequacy. 

(d) Interpret the results of this experiment. Assume that a layer thickness of between 
140 and 160 is desirable. 

8-32. Five factors are studied in the irregular fractional factorial design of resolution V shown 
below: 


Standard Run 

Order Order A B 


2 10 - + 

3 5 + + 

4 4 - - 

5 15 + - 

6 19 - + 

7 16 

8 7 + - 

9 8 + + 

10 3 + - 

11 13 - + 

12 11 + + 

13 12 

14 20 + - 

15 9 + + 


CPE y 

- - - 16.33 

- - - 18.43 

- - - 27.07 

+ - - 16.95 

+ - - 14.58 

+ - - 19.12 

- + - 18.96 

- + - 23.56 

- + - 29.15 

+ + - 15.74 

+ + - 20.73 

+ + - 21.52 

- - + 15.58 

- - + 21.03 

- - + 26.78 
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Standard Run 

Order Order A B 

16 22 + - 

17 21 - + 

18 6 + + 

19 23 

20 18 - + 

21 24 + + 

22 17 

23 2 + - 

24 14 - + 


D 


y 

13.39 

18.63 

19.01 

17.96 

20.49 

29.31 

17.62 

16.03 

21.42 


(a) Analyze the data from this experiment. What factors influence the response y? 

(b) Analyze the residuals. Comment on model adequacy. 


I Three-Level and 
Mixed-Level Factorial 

and Fractional _ 

Factorial Designs 


The two-level series of factorial and fractional factorial designs discussed in Chapters 6, 
7, and 8 are widely used in industrial research and development. There are some exten¬ 
sions and variations of these designs that are occasionally useful, such as the designs for 
cases where all the factors are present at three levels. These 3* designs will be discussed 
in this chapter. We will also consider cases where some factors have two levels and other 
factors have either three or four levels. 


9-1 THE 3 k FACTORIAL DESIGN 

9-1.1 Notation and Motivation for the 3 k Design 

We now discuss the 3* factorial design; that is, a factorial arrangement with k factors 
each at three levels. Factors and interactions will be denoted by capital letters. We will 
refer to the three levels of the factors as low, intermediate, and high. There are several 
different notations used to represent these factor levels; one possibility is to represent 
the factor levels by the digits 0 (low), 1 (intermediate), and 2 (high). Each treatment 
combination in the 3 k design will be denoted by k digits, where the first digit indicates 
the level of factor A, the second digit indicates the level of factor B,. . ., and the Mi 
digit indicates the level of factor K. For example, in a 3 2 design, 00 denotes the treatment 
combination corresponding to A and B both at the low level, and 01 denotes the treatment 
combination corresponding to A at the low level and B at the intermediate level. Figures 
9-1 and 9-2 (on the next page) show the geometry of the 3 2 and the 3 3 design, respectively, 
using this notation. 

This system of notation could have been used for the 2 k designs presented previously, 
with 0 and 1 used in place of the minus and plus Is, respectively. In the 2 k design, we 
prefer the ± 1 notation because it facilitates the geometric view of the design and because 
it is directly applicable to regression modeling, blocking, and the construction of frac¬ 
tional factorials. 

In the 3 k system of designs, when the factors are quantitative, we often denote the 
low, intermediate, and high levels by —1, 0, and +1, respectively. This facilitates fitting 
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10 20 


Factor A 

Figure 9-1 Treatment combinations in a 3 2 design. 


a regression model relating the response to the factor levels. For example, consider the 
3 2 design in Figure 9-1, and let jc, represent factor A and x 2 represent factor B. A re¬ 
gression model relating the response y to x x and x 2 that is supported by this design is 

y = j8 0 + p x x x + 02*2 + P\ 2 XiX 2 + j8 n x? + j S 22 xl + e (9-1) 

Notice that the addition of a third factor level allows the relationship between the response 
and the design factors to be modeled as a quadratic. 

The 3* design is certainly a possible choice by an experimenter who is concerned 
about curvature in the response function. However, two points need to be considered: 

1. The 3* design is not the most efficient way to model a quadratic relationship; 
the response surface designs discussed in Chapter 11 are superior alternatives. 



Figure 9-2 Treatment combinations in a 3 3 design. 
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2. The 2 k design augmented with center points, as discussed in Chapter 6, is an 
excellent way to obtain an indication of curvature. It allows one to keep the size 
and complexity of the design low and simultaneously obtain some protection 
against curvature. Then, if curvature is important, the two-level design can be 
augmented with axial runs to obtain a central composite design, as shown in 
Figure 6-36. This sequential strategy of experimentation is far more efficient 
than running a 3 k factorial design with quantitative factors. 


9-1.2 The 3 2 Design 

The simplest design in the 3 k system is the 3 2 design, which has two factors, each at 
three levels. The treatment combinations for this design were shown in Figure 9-1. Be¬ 
cause there are 3 2 = 9 treatment combinations, there are eight degrees of freedom be¬ 
tween these treatment combinations. The main effects of A and B each have two degrees 
of freedom, and the AB interaction has four degrees of freedom. If there are n replicates, 
there will be n3 2 - 1 total degrees of freedom and 3 2 (n — 1) degrees of freedom for 
error. 

The sums of squares for A, B, and AB may be computed by the usual methods for 
factorial designs discussed in Chapter 5. Each main effect can be represented by a linear 
and a quadratic component, each with a single degree of freedom, as demonstrated in 
Equation 9-1. Of course, this is only meaningful if the factor is quantitative. 

The two-factor interaction AB may be partitioned in two ways. The first method 
consists of subdividing AB into the four single-degree-of-freedom components corre¬ 
sponding to AB LX l , AB lxq , ABqxl, and AB QXQ . This can be done by fitting the terms 
P 12 X 1 X 2 , P 122 X 1 X 2 , /3i 12 * 1 * 2 , and /3i 122 * 1 * 2 , respectively, as demonstrated in Example 5-5. 
For the tool life data, this yields SS ABlxl = 8.00, SS ABlxq = 42.67, SS ABqxl = 2.67, and 
SS ABgxe = 8.00. Because this is an orthogonal partitioning of AB, note that SS AB = 
SS ABlxl + SS ABlxq + SS ABqxl + SS ABqxq = 61.34. 

The second method is based on orthogonal Latin squares. Consider the totals of 
the treatment combinations for the data in Example 5-5. These totals are shown in Figure 
9-3 as the circled numbers in the squares. The two factors A and B correspond to the 
rows and columns, respectively, of a 3 X 3 Latin square. In Figure 9-3, two particular 
3X3 Latin squares are shown superimposed on the cell totals. 

These two Latin squares are orthogonal; that is, if one square is superimposed on 
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(a) ( b ) 

Figure 9-3 Treatment combination totals from Example 5-5 with two orthogonal 
Latin squares superimposed. 
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the other, each letter in the first square will appear exactly once with each letter in the 
second square. The totals for the letters in the (a) square are Q = 18, R = -2, and 
S = 8, and the sum of squares between these totals is [18 2 + (-2) 2 + 8 2 ]/(3)(2) - 
[24 2 /(9)(2)] = 33.34, with two degrees of freedom. Similarly, the letter totals in the (b) 
square are Q = 0, R = 6, and S = 18, and the sum of squares between these totals is 
[0 2 + 6 2 + 18 2 ]/(3)(2) - [24 2 /(9)(2)] = 28.00, with two degrees of freedom. Note that 
the sum of these two components is 

33.34 + 28.00 = 61.34 = SS AB 
with 2 + 2 = 4 degrees of freedom. 

In general, the sum of squares computed from square ( a ) is called the AB component 
of interaction, and the sum of squares computed from square (b) is called AB 2 com¬ 
ponent of interaction. The components AB and AB 2 each have two degrees of freedom. 
This terminology is used because if we denote the levels (0, 1, 2) for A and B by x, and 
x 2 , respectively, then we find that the letters occupy cells according to the following 
pattern: 


Square (a) 

Q : x t + x 2 = 0 (mod 3) 
R : xi + x 2 = 1 (mod 3) 
S: x x + x 2 = 2 (mod 3) 


Square( b ) 

Q: X\ + 2x 2 = 0 (mod 3) 
S: Xi + 2x 2 = 1 (mod 3) 
R: Xy + 2x 2 = 2 (mod 3) 


For example, in square ( b ), note that the middle cell corresponds to x x = 1 and x 2 = 1; 
thus, Xy + 2x 2 = 1 + (2)(1) = 3 = 0 (mod 3), and Q would occupy the middle cell. 
When considering expressions of the form A p B q , we establish the convention that the 
only exponent allowed on the first letter is 1. If the first letter exponent is not 1, the entire 
expression is squared and the exponents are reduced modulus 3. For example, A 2 B is the 
same as AB 2 because 


A 2 B = (A 2 B) 2 = A 4 B 2 = AB 2 


The AB and AB 2 components of the AB interaction have no actual meaning and are 
usually not displayed in the analysis of variance table. However, this rather arbitrary 
partitioning of the AB interaction into two orthogonal two-degree-of-freedom compo¬ 
nents is very useful in constructing more complex designs. Also, there is no connection 
between the AB and AB 2 components of interaction and the sums of squares for 
AB l><l , AB LX q, AB qxl , and AB QXQ . 

The AB and AB 2 components of interaction may be computed another way. Consider 
the treatment combination totals in either square in Figure 9-3. If we add the data by 
diagonals downward from left to right, we obtain the totals —3 + 4— 1 =0,-3 + 
10 - 1 = 6, and 5 + 11 + 2= 18. The sum of squares between these totals is 
28.00 (AB 2 ). Similarly, the diagonal totals downward from right to left are 5 + 4 - 
1 = 8, -3 + 2 - 1 = -2, and -3 + 11 + 10 = 18. The sum of squares between these 
totals is 33.34 (AB). Yates called these components of interaction the I and J components 
of interaction, respectively. We use both notations interchangeably; that is. 


I(AB) = AB 2 
./(AB) = AB 
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9-1.3 The 3 3 Design 

Now suppose there are three factors (A, B, and C) under study, and each factor is at three 
levels arranged in a factorial experiment. This is a 3 3 factorial design, and the experi¬ 
mental layout and treatment combination notation were shown previously in Figure 9-2. 
The 27 treatment combinations have 26 degrees of freedom. Each main effect has 2 
degrees of freedom, each two-factor interaction has 4 degrees of freedom, and the three- 
factor interaction has 8 degrees of freedom. If there are n replicates, there are «3 3 - 1 
total degrees of freedom and 3 3 (n — 1) degrees of freedom for error. 

The sums of squares may be calculated using the standard methods for factorial 
designs. In addition, if the factors are quantitative the main effects may be partitioned 
into linear and quadratic components, each with a single degree of freedom. The two- 
factor interactions may be decomposed into linear X linear, linear X quadratic, quadratic 
X linear, and quadratic X quadratic effects. Finally, the three-factor interaction ABC can 
be partitioned into eight single-degree-of-freedom components corresponding to linear 
X linear X linear, linear X linear X quadratic, and so on. Such a breakdown for the 
three-factor interaction is generally not very useful. 

It is also possible to partition the two-factor interactions into their I and J compo¬ 
nents. These would be designated AB, AB 2 , AC, AC 2 , BC, and BC 2 , and each component 
would have two degrees of freedom. As in the 3 2 design, these components have no 
physical significance. 

The three-factor interaction ABC may be partitioned into four orthogonal two- 
degrees-of-freedom components, which are usually called the W, X, Y, and Z components 
of the interaction. They are also referred to as the AB 2 C 2 , AB 2 C, ABC 2 , and ABC com¬ 
ponents of the ABC interaction, respectively. The two notations are used interchangeably; 
that is, 

W(ABC) = AB 2 C 2 
X(ABC) = AB 2 C 
Y(ABC) = ABC 2 
Z(ABC) = ABC 

Note that no first letter can have an exponent other than 1. Like the I and J components, 
the W, X, Y, and Z components have no practical interpretation. They are, however, useful 
in constructing more complex designs. 

EXAMPLE 9-1 . 

A machine is used to fill 5-gallon metal containers with soft drink syrup. The variable 
of interest is the amount of syrup loss due to frothing. Three factors are thought to 
influence frothing: the nozzle design (A), the filling speed (B), and the operating pressure 
(C). Three nozzles, three filling speeds, and three pressures are chosen and two replicates 
of a 3 3 factorial experiment are run. The coded data are shown in Table 9-1 on page 368. 

The analysis of variance for the syrup loss data is shown in Table 9-2 on page 368. 
The sums of squares have been computed by the usual methods. We see that the filling 
speed and operating pressure are statistically significant. All three two-factor interactions 
are also significant. The two-factor interactions are analyzed graphically in Figure 9-4 
on page 369. The middle level of speed gives the best performance, nozzle types 2 and 
3, and either the low (10 psi) or high (20 psi) pressure seem most effective in reducing 
syrup loss. 
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Table 9-1 Syrup Loss Data for Example 9-1 (units are cubic centimeters - 70) 


Pressure (in psi) 
(C) 




Nozzle Type (A) 





1 



2 



3 





Speed (in RPM) (5) 




100 

120 

140 

100 

120 

140 

100 

120 

140 

10 

-35 

-45 

-40 

17 

-65 

20 

-39 

-55 

15 


-25 

-60 

15 

24 

-58 

4 

-35 

-67 

-30 

15 

no 

-10 

80 

55 

-55 

110 

90 

-28 

110 


75 

30 

54 

120 

-44 

44 

113 

-26 

135 

20 

4 

-40 

31 

-23 

-64 

-20 

-30 

-61 

54 


5 

-30 

36 

-5 

-62 

-31 

-55 

-52 

4 


Example 9-1 illustrates a situation where the three-level design often finds some 
application; one or more of the factors is qualitative, naturally taking on three levels, 
and the remaining factors are quantitative. In this example, suppose that there are only 
three nozzle designs that are of interest. This is clearly, then, a qualitative factor that 
requires three levels. The filling speed and the operating pressure are quantitative factors. 
Therefore, we could fit a quadratic model such as Equation 9-1 in the two factors speed 
and pressure at each level of the nozzle factor. 

Table 9-3 (on the facing page) shows these quadratic regression models. The /3’s in 
these models were estimated using a standard linear regression computer program. (We 
will discuss least squares regression in more detail in Chapter 10.) In these models, the 
variables x t and x 2 are coded to the levels -1, 0, +1 as discussed previously, and we 
assumed the following natural levels for pressure and speed: 



Coded Level 

Speed (psi) 

Pressure (rpm) 




-1 

100 

10 




0 

120 

15 




+ 1 

140 

20 



Table 9-3 presents models both in terms of these 

coded variables and in terms of the 

natural levels of speed and pressure. 




Table 9-2 

Analysis of Variance for Syrup Loss Data 




Source of 

Sum of 

Degrees of 

Mean 



Variation 

Squares 

Freedom 

Square 

F 0 

P-Value 

A, nozzle 

993.77 

2 

496.89 

1.17 

0.3256 

B, speed 

61,190.33 

2 

30,595.17 

71.74 

<0.0001 

C, pressure 

69,105.33 

2 

34,552.67 

81.01 

<0.0001 

AB 

6,300.90 

4 

1,575.22 

3.69 

0.0383 

AC 

7,513.90 

4 

1,878.47 

4.40 

0.0222 

BC 

12,854.34 

4 

3,213.58 

7.53 

0.0025 

ABC 

4,628.76 

8 

578.60 

1.36 

0.2737 

Error 

11,515.50 

27 

426.50 



Total 

174,102.83 

53 
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Speed in rpm IB) 
(c) 

Figure 9-4 Two-factor interactions for Example 9-1. 


Table 9-3 Regression Models for Example 9-1 

Nozzle Type _ x, = Speed (S), x 2 = Pressure (P) in Coded Units _ 

j y = 22.1 + 3.5x, + 16.3x 2 + 51.7x? - 71.8x| + 2.9x,x 2 

_ y = 1217.3 - 31.2565 + 86.017P + 0.12917S 2 - 2.8733P 2 + 0.02815SP 

2 y = 25.6 - 22.8x, - 12.3x 2 + 14. lx? - 56.9x| - 0.7x,x 2 

_ y = 180.1 - 9.4755 + 66.75P + 0.035.S’ 2 - 2.2161P 2 - 0.0075SP 

3 y = 15.1 + 20.3x, + 5.9x 2 + 75.8x? - 94.9x? + 10.5x,x 2 

y = 1940.1 - 40.0585 + 102.48P + 0.189585 2 - 3.7967.P 2 + 0.105S/> 
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Figure 9-5 shows the response surface contour plots of constant syrup loss as a 
function of speed and pressure for each nozzle type. These plots reveal considerable 
useful information about the performance of this filling system. Because the objective is 
to minimize syrup loss, nozzle type 3 would be preferred as the smallest observed con¬ 
tours (—60) appear only on this plot. Filling speed near the middle level of 120 rpm and 
either the low or high pressure levels should be used. 

When constructing contour plots for an experiment that has a mixture of quantitative 
and qualitative factors, it is not unusual to find that the shapes of the surfaces in the 
quantitative factors are very different at each level of the qualitative factors. This is 
noticeable to some degree in Figure 9-5, where the shape of the surface for nozzle type 
2 is considerably elongated in comparison to the surfaces for nozzle types 1 and 3. When 
this occurs, it implies that the optimum operating conditions (and other important con¬ 
clusions) in terms of the quantitative factors are very different at each level of the qual¬ 
itative factors. 

We can easily show the numerical partitioning of the ABC interaction into its four 
orthogonal two-degree-of-freedom components using the data in Example 9-1. The gen¬ 
eral procedure has been described by Cochran and Cox (1957) and Davies (1956). First, 
select any two of the three factors, say AB, and compute the / and J totals of the AB 
interaction at each level of the third factor C. These calculations follow: 


-60 

-105 

-25 


-74 

-122 

-15 


-106 

-155 


-222 

-79 

-158 


-144 

-40 


The I(AB ) and J(AB ) totals are now arranged in a two-way table with factor C, and the 
I and / diagonal totals of this new display are computed: 


c 


I(AB) 


Totals 

/ 

J 

C 


J(AB) 


Totals 

I J 

10 

-198 

-106 

-155 

-149 

41 

10 

-222 

-79 

-158 

63 138 

15 

331 

255 

377 

212 

19 

15 

238 

440 

285 

62 4 

20 

-59 

-74 

-206 

102 

105 

20 

-144 

-40 

-155 

40 23 


The I and J diagonal totals computed above are actually the totals representing the 
quantities I[I(AB) X C] = AB 2 C 2 , J[I(AB) X C] = AB 2 C, I\J(AB) X C] = ABC 2 , and 
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J[J(AB) X C] = ABC, or the W, X, Y, and Z components of ABC. The sums of squares 
are found in the usual way; that is, 

I[l(AB) X C)] = AB 2 C 2 = W(ABC) 

= (—149) 2 + (212) 2 + (102) 2 _ (I65f = 

18 54 

J[I(AB) X C] = AB 2 C = X(ABC) 

= (41) 2 + (19) 2 + (105) 2 _ (165 f = 

18 54 

f[J(AB) X C] = ABC 2 = Y(ABC) 

= (63) 2 + (62) 2 + (40) 2 _ (I65f = 

18 54 

J[J(AB X C] = ABC = Z(ABC) 

= (138) 2 + (4) 2 + (23) 2 _ (165)^ = 584 
18 54 

Although this is an orthogonal partitioning of SS ABC , we point out again that it is not 
customarily displayed in the analysis of variance table. In subsequent sections, we discuss 
the occasional need for the computation of one or more of these components. 

9-1.4 The General 3 k Design 

The concepts utilized in the 3 2 and 3 3 designs can be readily extended to the case of k 
factors, each at three levels, that is, to a 3* factorial design. The usual digital notation is 
employed for the treatment combinations, so 0120 represents a treatment combination in 
a 3 4 design with A and D at the low levels, B at the intermediate level, and C at the high 
level. There are 3 k treatment combinations, with 3* - 1 degrees of freedom between 
them. These treatment combinations allow sums of squares to be determined for k main 
effects, each with two degrees of freedom; (£) two-factor interactions, each with four 
degrees of freedom;...; and one ^-factor interaction with 2 k degrees of freedom. In 
general, an /(-factor interaction has 2 h degrees of freedom. If there are n replicates, there 
are n3 k — 1 total degrees of freedom and 3 k (n — 1) degrees of freedom for error. 

Sums of squares for effects and interactions are computed by the usual methods for 
factorial designs. Typically, three-factor and higher interactions are not broken down any 
further. However, any /(-factor interaction has 2 h 1 orthogonal two-degrees-of-freedom 
components. For example, the four-factor interaction ABCD has 2 4_1 = 8 orthogonal 
two-degrees-of-freedom components, denoted by ABCD 2 , ABC 2 D, AB 2 CD, ABCD, 
ABC 2 D 2 , AB 2 C 2 D, AB 2 CD 2 , and AB 2 C 2 D 2 . In writing these components, note that the 
only exponent allowed on the first letter is 1. If the exponent on the first letter is not 1, 
then the entire expression must be squared and the exponents reduced modulus 3. To 
demonstrate, consider 

A 2 BCD = ( A 2 BCDf = A 4 B 2 C 2 D 2 = AB 2 C 2 D 2 
These interaction components have no physical interpretation, but they are useful in 
constructing more complex designs. 

The size of the design increases rapidly with k. For example, a 3 3 design has 27 
treatment combinations per replication, a 3 4 design has 81, a 3 5 design has 243, and so 
on. Therefore, frequently only a single replicate of the 3* design is considered, and higher- 
order interactions are combined to provide an estimate of error. As an illustration, if 
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three-factor and higher interactions are negligible, then a single replicate of the 3 3 design 
provides 8 degrees of freedom for error, and a single replicate of the 3 4 design provides 
48 degrees of freedom for error. These are still large designs for k 2= 3 factors, and, 
consequently, not too useful. 

9-2 CONFOUNDING IN THE 3 fc FACTORIAL DESIGN 

Even when a single replicate of the 3* design is considered, the design requires so many 
runs that it is unlikely that all 3* runs can be made under uniform conditions. Thus, 
confounding in blocks is often necessary. The 3* design may be confounded in 3 P in¬ 
complete blocks, where p < k. Thus, these designs may be confounded in three blocks, 
nine blocks, and so on. 

9-2.1 The 3* Factorial Design in Three Blocks 

Suppose that we wish to confound the 3* design in three incomplete blocks. These three 
blocks have two degrees of freedom among them; thus, there must be two degrees of 
freedom confounded with blocks. Recall that in the 3 k factorial series each main effect 
has two degrees of freedom. Furthermore, every two-factor interaction has four degrees 
of freedom and can be decomposed into two components of interaction (e.g., AB and 
AB 2 ), each with two degrees of freedom; every three-factor interaction has eight degrees 
of freedom and can be decomposed into four components of interaction (e.g .,ABC, ABC 2 , 
AB 2 C, and AB 2 C 2 ), each with two degrees of freedom; and so on. Therefore, it is con¬ 
venient to confound a component of interaction with blocks. 

The general procedure is to construct a defining contrast 

L = + a 2 x 2 + • • ■ + a k x k (9-2) 

where a, represents the exponent on the ith factor in the effect to be confounded and x, 
is the level of the zth factor in a particular treatment combination. For the 3* series, we 
have a, = 0, 1, or 2 with the first nonzero a, being unity, and x t = 0 (low level), 1 
(intermediate level), or 2 (high level). The treatment combinations in the 3 k design are 
assigned to blocks based on the value of L (mod 3). Because L (mod 3) can take on only 
the values 0, 1, or 2, three blocks are uniquely defined. The treatment combinations 
satisfying L = 0 (mod 3) constitute the principal block. This block will always contain 
the treatment combination 00 ... 0. 

For example, suppose we wish to construct a 3 2 factorial design in three blocks. 
Either component of the AB interaction, AB or AB 2 , may be confounded with blocks. 
Arbitrarily choosing AB 2 , we obtain the defining contrast 

L = x x + 2 x 2 

The value of L (mod 3) of each treatment combination may be found as follows: 

00: L = 1(0) + 2(0) = 0 = 0 (mod 3) 11: L = 1(1) + 2(1) = 3 = 0 (mod 3) 

01: L = 1(0) + 2(1) = 2 = 2 (mod 3) 21: L = 1(2) + 2(1) = 4=1 (mod 3) 

02: L = 1(0) + 2(2) = 4=1 (mod 3) 12: L = 1(1) + 2(2) = 5 = 2 (mod 3) 

10: L = 1(1) + 2(0) =1 = 1 (mod 3) 22: L = 1(2) + 2(2) = 6 = 0 (mod 3) 

20: L = 1(2) + 2(0) = 2 = 2 (mod 3) 

The blocks are shown in Figure 9-6 on the next page. 
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Block 1 


00 

11 

22 


Block 2 


10 

21 

02 


Block: 


01 

12 

20 


(a) Assignment of the treatment 
combinations to blocks 


00 10 


• = Block 1 
O - Block 2 
a = Block 3 


1_I_I_I_ 

0 1 2 
Factor A 

(6) Geometric view 

Figure 9-6 The 3 2 design in three blocks with AB 2 
confounded. 


The elements in the principal block form a group with respect to addition modulus 
3. Referring to Figure 9-6, we see that 11 + 11 = 22, and 11 + 22 = 00. Treatment 
combinations in the other two blocks may be generated by adding, modulus 3, any 
element in the new block to the elements of the principal block. Thus, for block 2 we 
use 10 and obtain 

10 + 00 = 10 10 + 11 = 21 and 10 + 22 = 02 

To generate block 3, using 01 we find 

01 + 00 = 01 01 + 11 = 12 and 01 + 22 = 20 

EXAMPLE 9-2 . 

We illustrate the statistical analysis of the 3 2 design confounded in three blocks by using 
the following data, which come from the single replicate of the 3 2 design shown in Figure 
9-6. 


Block Totals = 


Block 1 


Block 2 Block 3 


00 = 4 

11 = -4 
22 = 0 


10 = -2 
21 = 1 
02 = 8 


01 = 5 

12 = -5 
20 = 0 


0 7 0 


Using conventional methods for the analysis of factorials, we find that SS A = 131.56 and 
SS B = 0.22. 
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Table 9-4 Analysis of Variance for Data 
in Example 9-2 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Blocks (AB 2 ) 

10.89 

2 

A 

131.56 

2 

B 

0.22 

2 

AB 

2.89 

2 

Total 

145.56 

8 


We also find that 


^Blocks — 


(0) 2 + (7) 2 + (0) 2 
3 


( 7 f 

9 


10.89 


However, SS BloC k S is exactly equal to the AB 2 component of interaction. To see this, write 
the observations as follows: 


Factor B 

_0_1_2_ 

0 4 5 8 

Factor A 1 —2 —4 —5 

2 0 10 


Recall from Section 9-1.2 that the / or AB 2 component of the AB interaction may be 
found by computing the sum of squares between the left-to-right diagonal totals in the 
above layout. This yields 


SS AB 2 


(0) 2 + (0) 2 + (7) 2 
3 


(Jf 

9 


10.89 


which is identical to SS Blocks . 

The analysis of variance is shown in Table 9-4. Because there is only one replicate, 
no formal tests can be performed. It is not a good idea to use the AB component of 
interaction as an estimate of error. 


We now look at a slightly more complicated design—a 3 3 factorial confounded in 
three blocks of nine runs each. The AB 2 C 2 component of the three-factor interaction will 
be confounded with blocks. The defining contrast is 

L = x, + 2 x 2 + 2x 3 

It is easy to verify that the treatment combinations 000, 012, and 101 belong in the 
principal block. The remaining runs in the principal block are generated as follows: 

(1) 000 (4) 101 + 101 - 202 (7) 101 + 021 = 122 

(2) 012 (5) 012 + 012 = 021 (8) 012 + 202 = 211 

(3) 101 (6) 101 + 012 = 110 (9) 021 + 202 = 220 
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To find the runs in another block, note that the treatment combination 200 is not in the 
principal block. Thus, the elements of block 2 are 

(1) 200 + 000 = 200 (4) 200 + 202 = 102 (7) 200 + 122 = 022 

(2) 200 + 012 = 212 (5) 200 + 021 = 221 (8) 200 + 211 = 111 

(3) 200 + 101 = 001 (6) 200 + 110 = 010 (9) 200 + 220 = 120 

Notice that these runs all satisfy L = 2 (mod 3). The final block is found by observing 
that 100 does not belong in block 1 or 2. Using 100 as above yields 

(1) 100 + 000 = 100 (4) 100 + 202 = 002 (7) 100 + 122 = 222 

(2) 100 + 012 = 112 (5) 100 + 021 = 121 (8) 100 + 211 = 011 

(3) 100 + 101 = 201 (6) 100 + 110 = 210 (9) 100 + 220 = 020 

The blocks are shown in Figure 9-7. 
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Table 9-5 Analysis of Variance for a 3 3 Design 
with AB 2 C 2 Confounded 


Source of Variation 

Degrees of 
Freedom 

Blocks (AB 2 C 2 ) 

2 

A 

2 

B 

2 

C 

2 

AB 

4 

AC 

4 

BC 

4 

Error (ABC + AB 2 C + ABC 2 ) 

6 

Total 

26 


The analysis of variance for this design is shown in Table 9-5. Using this confound¬ 
ing scheme, information on all the main effects and two-factor interactions is available. 
The remaining components of the three-factor interaction (ABC, AB 2 C, and ABC 2 ) are 
combined as an estimate of error. The sum of squares for those three components could 
be obtained by subtraction. In general, for the 3* design in three blocks, we would always 
select a component of the highest-order interaction to confound with blocks. The re¬ 
maining unconfounded components of this interaction could be obtained by computing 
the ^-factor interaction in the usual way and subtracting from this quantity the sum of 
squares for blocks. 


9-2.2 The 3 k Factorial Design in Nine Blocks 

In some experimental situations it may be necessary to confound the 3* design in nine 
blocks. Thus, eight degrees of freedom will be confounded with blocks. To construct 

these designs, we choose two components of interaction and, as a result, two more will 

be confounded automatically, yielding the required eight degrees of freedom. These two 
are the generalized interactions of the two effects originally chosen. In the 3 k system, the 
generalized interactions of two effects (e.g., P and Q) are defined as PQ and PQ 2 (or 

P 2 Q). 

The two components of interaction initially chosen yield two defining contrasts 

Li = a x x x + a 2 x 2 + • • • + a k x k = u (mod 3) u = 0, 1, 2 ( 9 _ 3 ) 

L 2 = 0,*, + 0 2 * 2 + ■ • • + fi k x k = h (mod 3) h = 0, 1, 2 

where {a,} and {0,} are the exponents in the first and second generalized interactions, 
respectively, with the convention that the first nonzero a, and 0, are unity. The defining 
contrasts in Equation 9-3 imply nine simultaneous equations specified by the pair of 
values for L l and L 2 . Treatment combinations having the same pair of values for (L x , L 2 ) 
are assigned to the same block. 

The principal block consists of treatment combinations satisfying L } = L 2 = 0 (mod 
3). The elements of this block form a group with respect to addition modulus 3; thus, 
the scheme given in Section 9-2.1 can be used to generate the blocks. 
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As an example, consider the 3 4 factorial design confounded in nine blocks of nine 
runs each. Suppose we choose to confound ABC and AB 2 D 2 . Their generalized 
interactions 


(ABC)(AB 2 D 2 ) = A 2 B 3 CD 2 = (A 2 B 3 CD 2 ) 2 = AC 2 D 
(ABC)(AB 2 D 2 ) 2 = A 3 B s CD 4 = B 2 CD = C B 2 CD ) 2 = BC 2 D 2 

are also confounded with blocks. The defining contrasts for ABC and AB 2 D 2 are 

Lj = + x 2 + x 3 (9-4) 

L 2 — Xi + 2x 2 + 2*4 

The nine blocks may be constructed by using the defining contrasts (Equation 9-4) 
and the group-theoretic property of the principal block. The design is shown in Figure 
9-8. 

For the 3* design in nine blocks, there will be four components of interaction con¬ 
founded. The remaining unconfounded components of these interactions can be deter¬ 
mined by subtracting the sum of squares for the confounded component from the sum 
of squares for the entire interaction. The method described in Section 9-1.3 may be useful 
in computing the components of interaction. 


9*2.3 The 3 fc Factorial Design in 3 P Blocks 

The 3* factorial design may be confounded in 3 P blocks of 3 k ~ p observations each, where 
p < k. The procedure is to select p independent effects to be confounded with blocks. 
As a result, exactly (3 P - 2 p — l)/2 other effects are automatically confounded. These 
effects are the generalized interactions of those effects originally chosen. 

As an illustration, consider a 3 7 design to be confounded in 27 blocks. Because p = 
3, we would select three independent components of interaction and automatically con¬ 
found [3 3 - 2(3) — l]/2 = 10 others. Suppose we choose ABC 2 DG, BCE 2 F 2 G, and 
BDEFG. Three defining contrasts can be constructed from these effects, and the 27 blocks 


Block 2 Block 3 


Block 5 Block 6 Block 7 Block 8 Block 9 


(L 1 ,L 2 ) = (0,0) (0,1) 

Figure 9-8 The 3 4 design in 


2,2) (2,0) (2,1) (1,2) (1,1) (1,0) 
le blocks with ABC , AB 2 D 2 , AC 2 D, and BC 2 D 2 confounded. 
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can be generated by the methods previously described. The other 10 effects confounded 
with blocks are 

(ABC 2 DG)(BCE 2 F 2 G) = AB 2 DE 2 F 2 G 2 
(ABC 2 DG)(BCE 2 F 2 G) 2 = AB 3 C 4 DE 4 F 4 G 3 = ACDEF 
(ABC 2 DG)(BDEFG) = AB 2 C 2 D 2 EFG 2 
(ABC 2 DG)(BDEFG) 2 = AB 3 C 2 D 3 E 2 F 2 G 3 = AC 2 E 2 F 2 
(BCE 2 F 2 G)(BDEFG) = B 2 CDE 3 F 3 G 2 = BC 2 D 2 G 
{BCE 2 F 2 G){BDEFGf = B 3 CD 2 E 4 F 4 G 3 = CD 2 EF 
(ABC 2 DG)(BCE 2 F 2 G)(BDEFG) = AB 3 C 3 D 2 E 3 F 3 G 3 = AD 2 
(ABC 2 DG)\BCE 2 F 2 G)(BDEFG ) = a 2 b 4 c 5 d 3 g 4 = ab 2 cg 2 
(ABC 2 DG)(BCE 2 F 2 G) 2 (BDEFG) = abcd 2 e 2 f 2 g 
(ABC 2 DG)(BCE 2 F 2 G)(BDEFG) 2 = ABC 3 D 3 E 4 F 4 G 4 = ABEFG 

This is a huge design requiring 3 7 = 2187 observations arranged in 27 blocks of 81 
observations each. 


9-3 FRACTIONAL REPLICATION OF THE 3 k FACTORIAL DESIGN 

The concept of fractional replication can be extended to the 3* factorial designs. Because 
a complete replicate of the 3* design can require a rather large number of runs even for 
moderate values of k, fractional replication of these designs is of interest. As we shall 
see, however, some of these designs have unattractive alias structures. 


9-3.1 The One-Third Fraction of the 3 k Factorial Design 

The largest fraction of the 3 k design is a one-third fraction containing 3*~‘ runs. Con¬ 
sequently, we refer to this as a 3* _1 fractional factorial design. To construct a 3 A-1 
fractional factorial design select a two-degrees-of-freedom component of interaction 
(generally, the highest-order interaction) and partition the full 3 k design into three blocks. 
Each of the three resulting blocks is a 3 A_1 fractional design, and any one of the blocks 
may be selected for use. If AB“ 2 C“ 3 • • • K ak is the component of interaction used to define 
the blocks, then I = AB“ 2 C“ 3 • • • K°“ k is called the defining relation of the fractional 
factorial design. Each main effect or component of interaction estimated from the 3 A_I 
design has two aliases, which may be found by multiplying the effect by both / and / 2 
modulus 3. 

As an example, consider a one-th ir d fraction of the 3 3 design. We may select any 
component of the ABC interaction to construct the design, that is, ABC, AB 2 C, ABC 2 , or 
AB 2 C 2 . Thus, there are actually 12 different one-th ir d fractions of the 3 3 design defined 
by 

x i + a 2 x 2 + a 3 x 3 = u (mod 3) 

where a = 1 or 2 and u = 0,1, or 2. Suppose we select the component of AB 2 C 2 . Each 
fraction of the resulting 3 3 1 design will contain exactly 3 2 = 9 treatment combinations 
that must satisfy 

x l + 2x 2 + 2x 3 = u 


(mod 3) 
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where u = 0, 1, or 2. It is easy to verify that the three one-third fractions are as shown 
in Figure 9-9. 

If any one of the 3 3-1 designs in Figure 9-9 is run, the resulting alias structure is 

A = A(AB 2 C 2 ) = A 2 B 2 C 2 = ABC 
A = A(AB 2 C 2 ) 2 = A 3 B 4 C 4 = BC 
B = B(AB 2 C 2 ) = AB 3 C 2 = AC 2 
B = B(AB 2 C 2 ) 2 = A 2 B 5 C 4 = ABC 2 
C = C(AB 2 C 2 ) = AB 2 C 3 = AB 2 
C = C(AB 2 C 2 ) 2 = A 2 B 4 C 5 = AB 2 C 
AB = AB(AB 2 C 2 ) = A 2 B 3 C 2 = AC 
AB = AB(AB 2 C 2 ) 2 = A 3 B 5 C 4 = BC 2 


Consequently, the four effects that are actually estimated from the eight degrees of 
freedom in the design are A + BC + ABC, B + AC 2 + ABC 2 , C + AB 2 + AB 2 C, and 
AB + AC + BC 2 . This design would be of practical value only if all the interactions 
were small relative to the main effects. Because the main effects are aliased with two- 
factor interactions, this is a resolution III design. Notice how complex the alias relation¬ 
ships are in this design. Each main effect is aliased with a component of interaction. If, 
for example, the two-factor interaction BC is large, this will potentially distort the esti¬ 
mate of the main effect of A and make the AB + AC + BC 2 effect very difficult to 
interpret. It is very difficult to see how this design could be useful unless we assume that 
all interactions are negligible. 


Design 1 Design 1 Design 1 




(6) Geometric view 

Figure 9-9 The three one-third fractions of the 3 3 design with defining relation / = AB 2 C 2 . 
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Before leaving the 3 3 n 1 design, note that for the design with u = 0 (see Figure 9-9) 
if we let A denote the row and B denote the column, then the design can be written as 

000 012 021 

101 110 122 

202 211 220 

which is a 3 X 3 Latin square. The assumption of negligible interactions required for 
unique interpretations of the 3 3 n 1 design is paralleled in the Latin square design. How¬ 
ever, the two designs arise from different motives, one as a consequence of fractional 
replication and the other from randomization restrictions. From Table 4-13 we observe 
that there are only twelve 3X3 Latin squares and that each one corresponds to one of 
the twelve different 3 3_1 fractional factorial designs. 

The treatment combinations in a 3*” 1 design with the defining relation 1 = 
AB a2 C a 3 • • ■ K" k can be constructed using a method similar to that employed in the 2 k ~ p 
series. First, write down the 3* _1 runs for a full three-level factorial design in k - 1 
factors, with the usual 0, 1, 2 notation. This is the basic design in the terminology of 
Chapter 8. Then introduce the Mi factor by equating its levels x k to the appropriate 
component of the highest-order interaction, say AB" 2 C“ 3 ■ ■ ■ (K - 1 through the 
relationship 

x k = PiX\ + p 2 x 2 + ■■■ + p k -\x k -i (9-5) 

where /3, = (3 - a*)a, (mod 3) for 1 *£ i k — 1. This yields a design of the highest 
possible resolution. 

As an illustration, we use this method to generate the 3?v 1 design with the defining 
relation 7 = AB 2 CD shown in Table 9-6. It is easy to verify that the first three digits of 
each treatment combination in this table are the 27 runs of a full 3 3 design. This is the 
basic design. For AB 2 CD, we have a x = a 3 = a 4 = 1 and a 2 = 2. This implies that 
P, = (3 — l)a t (mod 3) = (3 - 1)(1) = 2, p 2 = (3 - 1 )a 2 (mod 3) = (3 - 1)(2) = 
4 = 1 (mod 3), and p 3 = (3 - l)a 3 (mod 3) = (3 - 1)(1) = 2. Thus, Equation 9-5 
becomes 


x 4 = 2x\ + x 2 + 2x 3 (9-6) 

The levels of the fourth factor satisfy Equation 9-6. For example, we have 2(0) + 1(0) + 
2(0) = 0, 2(0) + 1(1) + 2(0) = 1, 2(1) + 1(1) + 2(0) = 3 = 0, and so on. 

The resulting 3?v 1 design has 26 degrees of freedom that may be used to compute 

Table 9-6 A 3Jv 1 Design with 
I = AB 2 CD 

0000 0012 2221 

0101 0110 0021 

1100 0211 0122 

1002 1011 0220 

0202 1112 1020 

1201 1210 1121 

2001 2010 1222 

2102 2111 2022 

2200 2212 2120 
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the sums of squares for the 13 main effects and components of interactions (and their 
aliases). The aliases of any effect are found in the usual manner; for example, the aliases 
of A are A{AB 2 CD) = ABC 2 D 2 and A(AB 2 CD) 2 = BC 2 D 2 . One may verify that the four 
main effects are clear of any two-factor interaction components, but that some two-factor 
interaction components are aliased with each other. Once again, we notice the complexity 
of the alias structure. If any two-factor interactions are large, it will likely be very difficult 
to isolate them with this design. 

The statistical analysis of a 3* _1 design is accomplished by the usual analysis of 
variance procedures for factorial experiments. The sums of squares for the components 
of interaction may be computed as in Section 9-1. Remember when interpreting results 
that the components of interactions have no practical interpretation. 


9-3.2 Other 3 k p Fractional Factorial Designs 

For moderate to large values of k, even further fractionation of the 3* design is potentially 
desirable. In general, we may construct a (If fraction of the 3 k design for p < k, where 
the fraction contains 3 k ~ p runs. Such a design is called a 3 k ~ p fractional factorial design. 
Thus, a 3 k ~ 2 design is a one-ninth fraction, a 3* -3 design is a one-twenty-seventh frac¬ 
tion, and so on. 

The procedure for constructing a 3 k p fractional factorial design is to select p com¬ 
ponents of interaction and use these effects to partition the 3* treatment combinations 
into 3 P blocks. Each block is then a 3 k ~ p fractional factorial design. The defining relation 
/ of any fraction consists of the p effects initially chosen and their (3 P - 2p - l)/2 
generalized interactions. The alias of any main effect or component of interaction is 
produced by multiplication modulus 3 of the effect by / and I 2 . 

We may also generate the runs defining a 3 k ~ p fractional factorial design by first 
writing down the treatment combinations of a full 3 k ~ p factorial design and then intro¬ 
ducing the additional p factors by equating them to components of interaction, as we did 
in Section 9-3.1. 

We illustrate the procedure by constructing a 3 4-2 design, that is, a one-ninth fraction 
of the 3 4 design. Let AB 2 C and BCD be the two components of interaction chosen to 
construct the design. Their generalized interactions are ( AB 2 C)(BCD ) = AC 2 D and 
(AB 2 C)(BCD) 2 = ABD 2 . Thus, the defining relation for this design is / = AB 2 C = 
BCD = AC 2 D = ABD 2 , and the design is of resolution III. The nine treatment combi¬ 
nations in the design are found by writing down a 3 2 design in the factors A and B, and 
then adding two new factors by setting 


Jt 3 = 2xi + x 2 
jc 4 = 2x 2 + 2x 3 


Table 9-7 A 3?n 2 Design 
with I = AB Z C 

_ and I = BCD 

0000 0111 0222 

1021 1102 1210 

2012 2120 2201 
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Table 9-8 Alias Structure for the 3^ 2 Design in Table 9-7 


Effect 





Aliases 







I 



1 

2 


A 

ABC 2 

ABCD 

ACD 2 

AB 2 D 

BC 2 

AB 2 C 2 D 2 

CD 2 

BD 2 

B 

AC 

BC 2 D 2 

ABC 2 D 

AB 2 D 2 

ABC 

CD 

AB 2 C 2 D 

AD 2 

C 

AB 2 C 2 

BC 2 D 

AD 

ABCD 2 

AB 2 

BD 

ACD 

ABC 2 D 2 

D 

AB 2 CD 

BCD 2 

AC 2 D 2 

AB 

AB 2 CD 2 

BC 

AC 2 

ABD 


This is equivalent to using AB 2 C and BCD to partition the full 3 4 design into nine blocks 
and then selecting one of these blocks as the desired fraction. The complete design is 
shown in Table 9-7 on the facing page. 

This design has eight degrees of freedom that may be used to estimate four main 
effects and their aliases. The aliases of any effect may be found by multiplying the effect 
modulus 3 by AB 2 C, BCD, AC 2 D, ABD 2 , and their squares. The complete alias structure 
for the design is given in Table 9-8. 

From the alias structure, we see that this design is useful only in the absence of 
interaction. Furthermore, if A denotes the rows and B denotes the columns, then from 
examining Table 9-7 we see that the 3fu 2 design is also a Graeco-Latin square. 

The publication by Connor and Zelen (1959) contains an extensive selection of 
designs for 4 =£ k =£ 10. This pamphlet was prepared for the National Bureau of Standards 
and is the most complete table of fractional 3 k ~ p plans available. 

In this section, we have noted several times the complexity of the alias relationships 
in 3 k ~ p fractional factorial designs. In general, if k is moderately large, say k > 4 or 5, 
the size of the 3 k design will drive most experimenters to consider fairly small fractions. 
Unfortunately, these designs have alias relationships that involve the partial aliasing of 
two-degrees-of-freedom components of interaction. This, in turn, results in a design that 
will be difficult if not impossible to interpret if interactions are not negligible. Further¬ 
more, there are no simple augmentation schemes (such as fold over) that can be used to 
combine two or more fractions to isolate significant interactions. The 3* design is often 
suggested as appropriate when curvature is present. However, there are more efficient 
alternatives (see Chapter 11). For these reasons, I feel that the 3 k ~ p fractional factorial 
designs are solutions looking for a problem—they are not generally good designs. 


9-4 FACTORIALS WITH MIXED LEVELS 

We have emphasized factorial and fractional factorial designs in which all the factors 
have the same number of levels. The two-level system discussed in Chapters 6, 7, and 8 
is particularly useful. The three-level system presented earlier in this chapter is much 
less useful because the designs are relatively large even for a modest number of factors, 
and most of the small fractions have complex alias relationships that would require very 
restrictive assumptions regarding interactions to be useful. 

It is our belief that the two-level factorial and fractional factorial designs should be 
the cornerstone of industrial experimentation for product and process development, trou¬ 
bleshooting, and improvement. There are, however, some situations in which it is nec¬ 
essary to include a factor (or a few factors) that has more than two levels. This usually 
occurs when there are both quantitative and qualitative factors in the experiment, and 
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Table 9-9 Use of Two-Level Factors to Form 

a Thr ee-Level Factor _ 

Two-Level Factors Three-L evel Factor 

B C X 


+ - *2 

— + *2 

+ + *3 


the qualitative factor has (say) three levels. If all factors are quantitative, then two-level 
designs with center points should be employed. In this section we show how some three- 
and four-level factors can be accommodated in a 2 k design. 


9-4.1 Factors at Two and Three Levels 

Designs in which some factors have two levels and other factors have three levels can 
be derived from the table of plus and minus signs for the usual 2 k design. The general 
procedure is best illustrated with an example. Suppose we have two variables, with A at 
two levels and X at three levels. Consider a table of plus and minus signs for the usual 
eight-run 2 3 design. The signs in columns B and C have the pattern shown on the left 
side of Table 9-9. Let the levels of X be represented by x u x 2 , and x 3 . The right side of 
Table 9-9 shows how the sign patterns for B and C are combined to form the levels of 
the three-level factor. 

Now factor X has two degrees of freedom and if the factor is quantitative it can be 
partitioned into a linear and a quadratic component, each component having one degree 
of freedom. Table 9-10 shows a 2 3 design with the columns labeled to show the actual 
effects that they estimate, with X L andX e denoting the linear and quadratic effects ofX, 
respectively. Note that the linear effect of X is the sum of the two effect estimates 
computed from the columns usually associated with B and C and that the effect of A can 
only be computed from the runs where X is at either the low or high levels, namely runs 
1, 2, 7, and 8. Similarly, the A X X L effect is the sum of the two effects that would be 
computed from the columns usually labeled AB and AC. Furthermore, note that runs 3 


Table 9-10 One 2-Level and One 3-Level Factor in a 2 3 Design 


Actual 

Treatment 

Combinations 

_ A _X_ 

Low Low 

High Low 

Low Med 

High Med 

Low Med 

High Med 

Low High 

High High 
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Table 9-11 Analysis of Variance for the Design in Table 9-10 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

A 

SS A 

1 

MS a 

X(X L + X Q ) 

SS X 

2 

MS b 

AX (A X X L + A X X Q ) 

ss AX 

2 

MS ax 

Error (from runs 3 and 5 
and runs 4 and 6) 

SS E 

2 

MS e 

Total 

SS T 

7 



and 5 are replicates. Therefore, a one-degree-of-freedom estimate of error can be made 
using these two runs. Similarly, runs 4 and 6 are replicates, and this would lead to a 
second one-degree-of-freedom estimate of error. The average variance at these two pairs 
of runs could be used as a mean square for error with two degrees of freedom. The 
complete analysis of variance is summarized in Table 9-11. 

If we are willing to assume that the two-factor and higher interactions are negligible, 
we can convert the design in Table 9-10 into a resolution HI fraction with up to four 
two-level factors and a single three-level factor. This would be accomplished by asso¬ 
ciating the two-level factors with columns A, AB, AC, and ABC. Column BC cannot be 
used for a two-level factor because it contains the quadratic effect of the three-level 
factor X. 

This same procedure can be applied to the 16-, 32-, and 64-run 2* designs. For 16 
runs, it is possible to construct resolution V fractional factorials with two two-level factors 
and either two or three factors at three levels. A 16-run resolution V fraction can also be 
obtained with 3 two-level factors and one three-level factor. If we include four two-level 
factors and a single three-level factor in 16 runs, the design will be of resolution III. The 
32- and 64-run designs allow similar arrangements. For additional discussion of some of 
these designs, see Addleman (1962). 


9-4.2 Factors at Two and Four Levels 

It is very easy to accommodate a four-level factor in a 2* design. The procedure for doing 
this involves using two two-level factors to represent the four-level factor. For example, 
suppose that A is a four-level factor with levels a u a 2 , a 3 , and a 4 . Consider two columns 
of the usual table of plus and minus signs, say columns P and Q. The pattern of signs in 
these two columns is as shown on the left side of Table 9-12. The right side of this table 
shows how these four sign patterns would correspond to the four levels of factor A. The 

Table 9-12 Four-Level Factor A Expressed as 2 
Two-Level Factors 

Two-Level Factors Four-Level Factor 

Run P Q A 


a 2 
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effects represented by columns P and Q and the PQ interaction are mutually orthogonal 
and correspond to the three-degrees-of-freedom A effect. 

To illustrate this idea more completely, suppose that we have one four-level factor 
and two two-level factors and that we need to estimate all the main effects and interac¬ 
tions involving these factors. This can be done with a 16-run design. Table 9-13 (on the 
facing page) shows the usual table of plus and minus signs for the 16-run 2 4 design, with 
columns A and B used to form the four-level factor, say X, with levels x u x 2 , x 3 , and x 4 . 
Sums of squares would be calculated for each column A, B ,..., ABCD just as in the 
usual 2 k system. Then the sums of squares for all factors X, C, D, and their interactions 
are formed as follows: 


SS X = SS A + SS B + SS AB 
SSc = SSc 
SS D = SS D 
SS CD = SScd 

SS X c = SS AC + SS BC + SS ABC 
SS XD = SS AD + SS B d + SS AB p 
SSxCD = SS AC D + SS B CD + SS ABC D 


(3 degrees of freedom) 
(1 degree of freedom) 
(1 degree of freedom) 
(1 degree of freedom) 
(3 degrees of freedom) 
(3 degrees of freedom) 
(3 degrees of freedom) 


This could be called a 4 X 2 2 design. If we are willing to ignore two-factor interactions, 
up to nine additional two-level factors can be associated with the two-factor interaction 
(except AB), three-factor interaction, and four-factor interaction columns. 


9-5 PROBLEMS - 

9-1. The effects of developer strength (A) and development time (B) on the density of pho¬ 
tographic plate film are being studied. Three strengths and three times are used, and four 
replicates of a 3 2 factorial experiment are ran. The data from this experiment follow. 
Analyze the data using the standard methods for factorial experiments. 


Developer Strength 


Development Time (minutes) 



10 

14 



18 

1 

0 

2 

1 

3 

2 

5 


5 

4 

4 

2 

4 

6 

2 

4 

6 

6 

8 

9 

10 


7 

5 

7 

7 

8 

5 

3 

7 

10 

10 

10 

12 

10 


8 

7 

8 

7 

9 

8 


9-2. Compute the I and J components of the two-factor interaction in Problem 9-1. 

9-3. An experiment was performed to study the effect of three different types of 32-ounce 
bottles (A) and three different shelf types (B )—smooth permanent shelves, end-aisle dis¬ 
plays with grilled shelves, and beverage coolers—on the time it takes to stock ten 12- 
bottle cases on the shelves. Three workers (factor C ) were employed in the experiment, 
and two replicates of a 3 3 factorial design were run. The observed time data are shown in 
the following table. Analyze the data and draw conclusions. 


388 CHAPTER 9 3-LEVEL, MIXED-LEVEL, AND FRACTIONAL FACTORIAL DESIGNS 


Worker 

Bottle Type 


Replicate I 



Replicate II 


Permanent 

End Aisle 

Cooler 

Permanent End Aisle 

Cooler 


Plastic 

3.45 

4.14 

5.80 

3.36 

4.19 

5.23 

1 

28-mm glass 

4.07 

4.38 

5.48 

3.52 

4.26 

4.85 


38-mm glass 

4.20 

4.26 

5.67 

3.68 

4.37 

5.58 


Plastic 

4.80 

5.22 

6.21 

4.40 

4.70 

5.88 

2 

28-mm glass 

4.52 

5.15 

6.25 

4.44 

4.65 

6.20 


38-mm glass 

4.96 

5.17 

6.03 

4.39 

4.75 

6.38 


Plastic 

4.08 

3.94 

5.14 

3.65 

4.08 

4.49 

3 

28-mm glass 

4.30 

4.53 

4.99 

4.04 

4.08 

4.59 


38-mm glass 

4.17 

4.86 

4.85 

3.88 

4.48 

4.90 


9-4. A medical researcher is studying the effect of lidocaine on the enzyme level in the heart 
muscle of beagle dogs. Three different commercial brands of lidocaine (A), three dosage 
levels (B), and three dogs (C) are used in the experiment, and two replicates of a 3 3 
factorial design are run. The observed enzyme levels follow. Analyze the data from this 
experiment. 


Lidocaine 

Brand 

Dosage 

Strength 


Replicate I 



Replicate II 



Dog 



Dog 


1 

2 

3 

1 

2 

3 


1 

96 

84 

85 

84 

85 

86 

1 

2 

94 

99 

98 

95 

97 

90 


3 

101 

106 

98 

105 

104 

103 


1 

85 

84 

86 

80 

82 

84 

2 

2 

95 

98 

97 

93 

99 

95 


3 

108 

114 

109 

110 

102 

100 


1 

84 

83 

81 

83 

80 

79 

3 

2 

95 

97 

93 

92 

96 

93 


3 

105 

100 

106 

102 

111 

108 


9-5. Compute the / and J components of the two-factor interactions for Example 10-1. 

9-6. An experiment is run in a chemical process using a 3 2 factorial design. The design factors 

are temperature and pressure, and the response variable is yield. The data that result from 
this experiment are shown below: 


Temperature, °C 

100 

Pressure, psig 

120 

140 

80 

47.58, 48.77 

64.97, 69.22 

80.92, 72.60 

90 

51.86, 82.43 

88.47, 84.23 

93.95, 88.54 

100 

71.18, 92.77 

96.57, 88.72 

76.58, 83.04 


(a) Analyze the data from this experiment by conducting an analysis of variance. What 
conclusions can you draw? 

(b) Graphically analyze the residuals. Are there any concerns about underlying assump¬ 
tions or model adequacy? 


9-5 PROBLEMS 389 

(c) Verify that if we let the low, medium, and high levels of both factors in this design 
take on the levels —1, 0, and +1, then a least squares fit to a second-order model 
for yield is 

y = 86.81 + 10.4jc, + 8.42x 2 - - 7.86x1 “ 7.69x,x 2 

(d) Confirm that the model in part (c) can be written in terms of the natural variables 
temperature (T) and pressure (P) as 

y = -1335.63 + 18.567- + 8.59P - 0.0727’ 2 - 0.0196P 2 - 0.03847P 

(e) Construct a contour plot for yield as a function of pressure and temperature. Based 
on examination of this plot, where would you recommend running this process? 

9-7. (a) Confound a 3 3 design in three blocks using the ABC 2 component of the three-factor 

interaction. Compare your results with the design in Figure 9-7. 

(b) Confound a 3 3 design in three blocks using the AB 2 C component of the three-factor 
interaction. Compare your results with the design in Figure 9-7. 

(c) Confound a 3 3 design in three blocks using the ABC component of the three-factor 
interaction. Compare your results with the design in Figure 9-7. 

(d) After looking at the designs in parts (a), (b), and (c) and Figure 9-7, what conclusions 
can you draw? 

9-8. Confound a 3 4 design in three blocks using the AB 2 CD component of the four-factor 
interaction. 

9-9. Consider the data from the first replicate of Problem 9-3. Assuming that all 27 observations 

could not be run on the same day, set up a design for conducting the experiment over 
three days with AB 2 C confounded with blocks. Analyze the data. 

9-10. Outline the analysis of variance table for the 3 4 design in nine blocks. Is this a practical 
design? 

9-11. Consider the data in Problem 9-3. If ABC is confounded in replicate I and ABC 2 is con¬ 
founded in replicate II, perform the analysis of variance. 

9-12. Consider the data from replicate I of Problem 9-3. Suppose that only a one-third fraction 
of this design with / = ABC is run. Construct the design, determine the alias structure, 
and analyze the data. 

9-13. From examining Figure 9-9, what type of design would remain if after completing the 
first nine runs, one of the three factors could be dropped? 

9-14. Construct a 3fv 1 design with I = ABCD. Write out the alias structure for this design. 
9-15. Verify that the design in Problem 9-14 is a resolution IV design. 

9-16. Construct a 3 5 ” 2 design with / = ABC and / = CDE. Write out the alias structure for this 
design. What is the resolution of this design? 

9-17. Construct a 3 9-6 design, and verify that it is a resolution III design. 

9-18. Construct a 4 X 2 3 design confounded in two blocks of 16 observations each. Outline the 
analysis of variance for this design. 

9-19. Outline the analysis of variance table for a 2 2 3 2 factorial design. Discuss how this design 
may be confounded in blocks. 

9-20. Starting with a 16-run 2 4 design, show how two three-level factors can be incorporated 
in this experiment. How many two-level factors can be included if we want some infor¬ 
mation on two-factor interactions? 

9-21. Starting with a 16-run 2 4 design, show how one three-level factor and three two-level 
factors can be accommodated and still allow the estimation of two-factor interactions. 
9-22. In Problem 8-26, you met Harry and Judy Peterson-Nedry, two friends of the author who 
have a winery and vineyard in Newberg, Oregon. That problem described the application 
of two-level fractional factorial designs to their 1985 Pinot Noir product. In 1987, they 
wanted to conduct another Pinot Noir experiment. The variables for this experiment were 
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Variable 

Clone of Pinot Noir 
Berry size 

Fermentation temperature 
Whole berry 
Maceration time 
Yeast type 
Oak type 


Levels 

Wadenswil, Pommard 
Small, large 

80°F, 85°F, 90/80°F, 90°F 
None, 10% 

10 days, 21 days 
Assmanhau, Champagne 
Tron9ais, Allier 


Harry and Judy decided to use a 16-run two-level fractional factorial design, treating the 
four levels of fermentation temperature as two two-level variables. As in Problem 8-26, 
they used the rankings from a taste-test panel as the response variable. The design and 
the resulting average ranks are shown below: 




Berry 

Ferm. 

Whole 

Macer. 

Yeast 

Oak 

Average 

Run 

Clone 

Size 

Temp. 

Berry 

Time 

Type 

Type 

Rank 

1 

2 

+ 

- 

- 

- 

- 

+ 

+ 

+ 

4 

10 

3 

- 

+ 

- 

- 

+ 

- 

+ 

+ 

6 

4 

+ 

+ 

- 

- 

+ 

+ 

- 

- 

9 

5 

- 

- 

+ 

- 

+ 

+ 

+ 

- 

11 

6 

+ 

- 

+ 

- 

+ 

- 

- 

+ 

1 

7 

- 

+ 

+ 

- 

- 

+ 

- 

+ 

15 

8 

+ 

+ 

+ 

- 

- 

- 

+ 

- 

5 

9 

- 

- 

- 

+ 

+ 

+ 

- 

+ 

12 

10 

+ 

- 

- 

+ 

-i- 

- 

+ 

- 

2 

11 

- 

+ 

- 

+ 

- 

+ 

+ 

- 

16 

12 

+ 

+ 

- 

+ 

- 

- 

- 

+ 

3 

13 

- 

- 

+ 

+ 

- 

- 

+ 

+ 

8 

14 

+ 

- 

+ 

+ 

- 

+ 

- 

- 

14 

15 

- 

+ 

+ 

+ 

+ 

- 

- 

- 

7 

16 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

13 

(a) 

Describe the aliasing in this design. 





(b) 

Analyze the data and draw conclusions. 





(c) 

What comparisons can you make between this experiment and the 1985 Pinot Noir 


experiment from Problem 8-26? 






An article by W. D. Baten in the 1956 volume of Industrial Quality Control described an 
experiment to study the effect of three factors on the lengths of steel bars. Each bar was 

subjected to one of two heat treatment processes and was cut on om 

s of four machines at 

one 

of three times during the day (8 a.m., 11 a.m. 

, or 3 p.m.). The coded length data are 

shown below: 

























Day 

Process 



1 

2 


3 

4 





6 

9 

7 9 

1 

2 

6 6 



1 


1 

3 

5 5 

0 

4 

7 3 

8 a.m. 



4 

6 

6 5 

-1 

0 

4 5 


0 1 3 4 0 1 5 4 
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. .. . iviacmne 

Time of Heat Treatment - 

Day Process 12 3 4 

6 3 8 7 3 2 7 9 

1 1 -1 4 8 1 0 11 6 


11 a.m. . 3 1642094 

Z 1-213-1163 


5 4 10 11 -1 2 10 5 

9 6 6 4 6 1 4 8 


3 p.m. . 60870 -2 43 

z 3 7 10 04 -4 70 


(a) Analyze the data from this experiment, assuming that the four observations in each 
cell are replicates. 

(b) Analyze the residuals from this experiment. Is there any indication that there is an 
outlier in one cell? If you find an outlier, remove it and repeat the analysis from part 
(a). What are your conclusions? 

(c) Suppose that the observations in the cells are the lengths (coded) of bars processed 
together in heat treating and then cut sequentially (that is, in order) on the four 
machines. Analyze the data to determine the effects of the three factors on mean 
length. 

(d) Calculate the log variance of the observations in each cell. Analyze this response. 
What conclusions can you draw? 

(e) Suppose the time at which a bar is cut really cannot be controlled during routine 
production. Analyze the average length and the log variance of the length for each 
of the 12 bars cut at each machine/heat treatment process combination. What con¬ 
clusions can you draw? 
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10-1 INTRODUCTION 

In many problems there are two or more variables that are related, and it is of interest 
to model and explore this relationship. For example, in a chemical process the yield of 
product is related to the operating temperature. The chemical engineer may want to build 
a model relating yield to temperature and then use the model for prediction, process 
optimization, or process control. 

In general, suppose that there is a single dependent variable or response y that 
depends on k independent or regressor variables, for example, x u x 2 ,..., x k . The 
relationship between these variables is characterized by a mathematical model called a 
regression model. The regression model is fit to a set of sample data. In some instances, 
the experimenter knows the exact form of the true functional relationship between y and 
jci, x 2 , ■ ■ ■ , x k , say y = <f>(x u x 2 ,..., x k ). However, in most cases, the true functional 
relationship is unknown, and the experimenter chooses an appropriate function to ap¬ 
proximate (f>. Low-order polynomial models are widely used as approximating functions. 

There is a strong interplay between design of experiments and regression analysis. 
Throughout this book we have emphasized the importance of expressing the results of 
an experiment quantitatively, in terms of an empirical model, to facilitate understanding, 
interpretation, and implementation. Regression models are the basis for this. On numer¬ 
ous occasions we have shown the regression model that represented the results of an 
experiment. In this chapter, we present some aspects of fitting these models. More com¬ 
plete presentations of regression are available in Montgomery and Peck (1992) and Myers 
(1990). 

Regression methods are frequently used to analyze data from unplanned experi¬ 
ments, such as might arise from observation of uncontrolled phenomena or historical 
records. Regression methods are also very useful in designed experiments where some¬ 
thing has “gone wrong.” We will illustrate some of these situations in this chapter. 
392 
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10-2 LINEAR REGRESSION MODELS 

We will focus on fitting linear regression models. To illustrate, suppose that we wish to 
develop an empirical model relating the viscosity of a polymer to the temperature and 
the catalyst feed rate. A model that might describe this relationship is 

y = i So + jSpti + + € (lo-i) 

where y represents the viscosity, x, represents the temperature, and x 2 represents the 
catalyst feed rate. This is a multiple linear regression model with two independent 
variables. We often call the independent variables predictor variables or regressors. 
The term linear is used because Equation 10-1 is a linear function of the unknown 
parameters p 0 , p u and /3 2 . The model describes a plane in the two-dimensional x { , x 2 
space. The parameter /3 0 defines the intercept of the plane. We sometimes call Pi and jS 2 
partial regression coefficients, because j8j measures the expected change in y per unit 
change in x x when x 2 is held constant, and p 2 measures the expected change in y per unit 
change in x 2 when jq is held constant. 

In general, the response variable y may be related to k regressor variables. The model 
y = Po + PiXi + P 2 X 2 + ■ • • + p k x k + e (10-2) 

is called a multiple linear regression model with k regressor variables. The parameters 
Pj, j = 0, 1,..., k, are called the regression coefficients. This model describes a hyper¬ 
plane in the ^-dimensional space of the regressor variables {x,}. The parameter /3, rep¬ 
resents the expected change in response y per unit change in Xj when all the remaining 
independent variables x, (i ± j) are held constant. 

Models that are more complex in appearance than Equation 10-2 may often still be 
analyzed by multiple linear regression techniques. For example, consider adding an in¬ 
teraction term to the first-order model in two variables, say 

y = Po + P 1 X 1 + P 2 X 2 + P 12 X 1 X 2 + e (10-3) 

If we let x 3 = x { x 2 and p 3 = p l2 , then Equation 10-3 can be written as 

y = Po + P 1 X 1 + P 2 X 2 + P 3 X 3 + e (10-4) 

which is a standard multiple linear regression model with three regressors. Recall that 
we presented empirical models like Equations 10-2 and 10-4 in several examples in 
Chapters 6, 7, and 8 to quantitatively express the results of a two-level factorial design. 
As another example, consider the second-order response surface model in two variables: 

y = Po + P 1 X 1 + P 2 x 2 + PuX 2 i + p 22 x\ + P 12 X 1 X 2 + e (10-5) 
If we let x 3 = xx 4 = x\, x 5 = x^, p 3 = p u , p 4 = p 22 , and p 5 = p l2 , then this 
becomes 


y = Po + P1X1 + P 2 x 2 + P3X3 + (84X4 + P5X5 + e (10-6) 

which is a linear regression model. We have also seen this model in examples earlier in 
the text. In general, any regression model that is linear in the parameters (the p values) 
is a linear regression model, regardless of the shape of the response surface that it 
generates. 

In this chapter we will summarize methods for estimating the parameters in multiple 
linear regression models. This is often called model fitting. We will also discuss methods 



394 CHAPTER 10 FITTING REGRESSION MODELS 


for testing hypotheses and constructing confidence intervals for these models as well as 
for checking the adequacy of the model lit. Our focus is primarily on those aspects of 
regression analysis useful in designed experiments. For more complete presentations of 
regression, refer to Montgomery and Peck (1992) and Myers (1990). 


10-3 ESTIMATION OF THE PARAMETERS 
IN LINEAR REGRESSION MODELS 

The method of least squares is typically used to estimate the regression coefficients in a 
multiple linear regression model. Suppose that n > k observations on the response vari¬ 
able are available, say y u y 2 . y„. Along with each observed response y h we will have 

an observation on each regressor variable and let x tj denote the z'th observation or level 
of variable x r The data will appear as in Table 10-1. We assume that the error term e in 
the model has £(e) = 0 and V(e) = cr 2 and that the {e,} are uncorrelated random variables. 

We may write the model equation (Equation 10-2) in terms of the observations in 
Table 10-1 as 

yi = 00 + Pl x il + 02*<2 + • • • + p k x ik + €f (10-7) 

= Po + 2 PjXij + e, i = 1, 2. . n 

The method of least squares chooses the 0’s in Equation 10-7 so that the sum of the 
squares of the errors, e„ is minimized. The least squares function is 

T * 2 (10 ' 8) 

= 2 (y. - Po - 2 PjXi^J 

The function L is to be minimized with respect to /3 0 > 0i> • • • > 0*- The least squares 
estimators, say 0 O , 0i,..., 0*, must satisfy 

- 2 PjXij) = 0 (10-9a) 


= 0 j = 1, 2,..., k (10-9b) 


and 
dL I 


£L,-■!(»-* 

■-2 2 (»-A.-2Ax„)^ 


Table 10-1 Data for Multiple Linear Regression 

y _ xi _*2__ x k _ 

yi In X12 ■■■ x u 

y-l x 2l X 22 • ■ • X 2k 
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Simplifying Equation 10-9, we obtain 

«ft + Pi E *n + ft E *a + ■ ■ ■ + ft E *» = E y, 

ft Y */i + 0i Y xfi + ft E x n x a + ■■■ + P k Y x n x ik = Y *,i y, 

/=i i <-i m '■=> (io-io) 

Po Y Xik + A E ^ipt/1 + ft E Xi k X (2 + ■ ■ ■ + PkYxfk = E T/aJ, 

These equations are called the least squares normal equations. Note that there are p = 
k + 1 normal equations, one for each of the unknown regression coefficients. The solution 
to the normal equations will be the least squares estimators of the regression coefficients 

/So, A.ft. 

It is simpler to solve the normal equations if they are expressed in matrix notation. 
We now give a matrix development of the normal equations that parallels the develop¬ 
ment of Equation 10-10. The model in terms of the observations, Equation 10-7, may be 
written in matrix notation as 


y = x/j + € 


where 



yi 


’l 

Xu x n • 

‘ *lXr 


yi 


1 

*21 *22 ‘ 

‘ *2* 

y = 

y»_ 

, x = 

1 

*nl *«2 * 

• x nk _ 


’ft' 


e, 




Pi 





P = 

.ft. 

and € = 





In general, y is an (n X 1) vector of the observations, X is an (n X p) matrix of the levels 
of the independent variables, /J is a (p X 1) vector of the regression coefficients, and e 
is an (n X 1) vector of random errors. 

We wish to find the vector of least squares estimators, 0, that minimizes 

L = E 3 = e'€ = (y - Xft'(y - x/3) 


Note that L may be expressed as 

L = y'y ~ P'X'y - y'Xp + p'X'Xp (10-11) 

= y'y - 2ftX'y + jS'X'XjS 

because ftX'y is a (1 X 1) matrix, or a scalar, and its transpose (ftX'y)' - y'X/J is the 
same scalar. The least squares estimators must satisfy 
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^ = — 2X'y + 2X'X/3 = 0 

dp |jg 

which simplifies to 

X'X/3 = X'y (10-12) 

Equation 10-12 is the matrix form of the least squares normal equations. It is identical 
to Equation 10-10. To solve the normal equations, multiply both sides of Equation 10-12 
by the inverse of X'X. Thus, the least squares estimator of j8 is 

P = (X'X) _1 X'y (10-13) 

It is easy to see that the matrix form of the normal equations is identical to the scalar 
form. Writing out Equation 10-12 in detail, we obtain 


n 2 */i x a ■■■ 2 x ik 

Po 


2 y t 

2) x n 2 x n S x n x i2 ■■■ E Xn x ik 

Pi 


2 x n y t 

'=, ,=, _ i-i . /=1 . 


_ 

1=1 : 

2 x >k 2 X ik X n 2 x ik x i2 2 xfk 

Pk 


2 xuy, 

_/=l -=1 i=l <=1 

. . 




If the indicated matrix multiplication is performed, the scalar form of the normal equa¬ 
tions (i.e., Equation 10-10) will result. In this form it is easy to see that X'X is a (p X p) 
symmetric matrix and X'y is a (p X 1) column vector. Note the special structure of the 
X'X matrix. The diagonal elements of X'X are the sums of squares of the elements in 
the columns of X, and the off-diagonal elements are the sums of cross-products of the 
elements in the columns of X. Furthermore, note that the elements of X'y are the sums 
of cross-products of the columns of X and the observations {y,}. 

The fitted regression model is 

y = x/j (10-14) 

In scalar notation, the fitted model is 

Si = Po + 2 PjXij i = 1 , 2 ,...,« 

i=i 

The difference between the actual observation y, and the corresponding fitted value y, is 
the residual, say e t = y, — y,. The (n X 1) vector of residuals is denoted by 

e = y - y (10-15) 

Estimating tr 2 

It is also usually necessary to estimate a 2 . To develop an estimator of this parameter, 
consider the sum of squares of the residuals, say 


SS E = 2 (yt - yf 

= %ef 

= e'e 
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Substituting e=y-y=y- X0, we have 

SS E = (y - X0)’( y - X0) 

= y'y - P'x'y - y'x/s + 0xx0 

= y'y - 2/8'X'y + jS'X'X/3 

Because X'X0 = X'y, this last equation becomes 

SS E = y'y - fi'X'y (10-16) 

Equation 10-16 is called the error or residual sum of squares, and it has n — p degrees 
of freedom associated with it. It can be shown that 

E(SS e ) = a 2 (n - p ) 

so an unbiased estimator of cr 2 is given by 

& 2 = (10-17) 

n- p 


Properties of the Estimators 

The method of least squares produces an unbiased estimator of the parameter j3 in the 
linear regression model. This may be easily demonstrated by taking the expected value 
of 0 as follows: 


E(fr = E[(X'X)-‘X'y] 

= EKX'Xr'X'fXj8 + e)] 

= E[(X'X) _I X'X^ + (X'X) _I X'e] 

= P 

because £(e) = 0 and (X'X) - 'X'X = I. Thus, 0 is an unbiased estimator of j8. 

The variance property of 0 is expressed in the covariance matrix: 

Cov(0) = E{[0 - E(0)][0 - E(0)Y} (10-18) 

which is just a symmetric matrix whose ith main diagonal element is the variance of the 
individual regression coefficient 0 t and whose (j/)th element is the covariance between 
0 and 0'j. The covariance matrix of 0 is 

Cov(0) = a 2 (X'X) 1 (10-19) 


EXAMPLE 10-1 . 

Sixteen observations on the viscosity of a polymer (y) and two process variables— 
reaction temperature (x,) and catalyst feed rate (x 2 )—are shown in Table 10-2 on the 
next page. We will fit a multiple linear regression model 


y = Po + PiXi + P 2 x 2 + e 
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Table 10-2 

Viscosity Data for Example 
@ 100°C) 

10-1 (viscosity ii 

n centistokes 

Observation 

Temperature 
(*i. °Q 

Catalyst 

Feed Rate 
to, lb/h) 

Viscosity 

1 

80 

8 

2256 

2 

93 

9 

2340 

3 

100 

10 

2426 

4 

82 

12 

2293 

5 

90 

11 

2330 

6 

99 

8 

2368 

7 

81 

8 

2250 

8 

96 

10 

2409 

9 

94 

12 

2364 

10 

93 

11 

2379 

11 

97 

13 

2440 

12 

95 

11 

2364 

13 

100 

8 

2404 

14 

85 

12 

2317 

15 

86 

9 

2309 

16 

87 

12 

2328 


to these data. The X matrix and y vector are 


1 80 

8 


2256 

1 93 

9 


2340 

1 100 

10 


2426 

1 82 

12 


2293 

1 90 

11 


2330 

1 99 

8 


2368 

1 81 

8 


2250 

1 96 

10 


2409 

1 94 

12 

y = 

2364 

1 93 

11 


2379 

1 97 

13 


2440 

1 95 

11 


2364 

1 100 

8 


2404 

1 85 

12 


2317 

1 86 

9 


2309 

1 87 

12 


2328 


The X'X matrix is 


r. . ... n 

80 93 • • • 87 


1 

1 


80 

93 


8 

9 


8 9 


12 . 


1 87 12 


X'X = 
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and the X'y vector is 


16 

1458 

164 

1458 

133,560 

14,946 

164 

14,946 

1,726 




1 

87 

12 


2256 

2340 

2328 


37,577 

3,429,550 

385,562 


The least squares estimate of /3 is 


£ = (X'X)-'X'y 


or 


" 14.176004 

-0.129746 

-0.223453 

37,577" 

-0.129746 

1.429184 X 10“ 3 

-4.763947 X 10" 5 

3,429,550 

_-0.223453 

-4.763947 X 10“ 5 

2.222381 X 10~ 2 

385,562 

"1566.07777" 

7.62129 

8.58485 





Table 10-3 Predicted Values, Residuals, and Other Diagnostics from Example 10-1 
Observation Predicted Residual Studentized 

i y, Value y t e, h ti Residual D, ^-Student 


1 2256 

2 2340 

3 2426 

4 2293 

5 2330 

6 2368 

7 2250 

8 2409 

9 2364 

10 2379 

11 2440 

12 2364 

13 2404 

14 2317 

15 2309 

16 2328 


2244.5 11.5 

2352.1 -12.1 

2414.1 11.9 

2294.0 -1.0 

2346.4 -16.4 

2389.3 -21.3 

2252.1 -2.1 

2383.6 25.4 

2385.5 -21.5 

2369.3 9.7 

2416.9 23.1 

2384.5 -20.5 

2396.9 7.1 

2316.9 0.1 

2298.8 10.2 

2332.1 -4.1 


0.350 0.87 

0.102 -0.78 

0.177 0.80 

0.251 -0.07 

0.077 -1.05 

0.265 -1.52 

0.319 -0.15 

0.098 1.64 

0.142 -1.42 

0.080 0.62 

0.278 1.66 

0.096 -1.32 

0.289 0.52 

0.185 0.01 

0.134 0.67 

0.156 -0.28 


0.137 0.87 

0.023 -0.77 

0.046 0.79 

0.001 -0.07 

0.030 -1.05 

0.277 -1.61 

0.004 -0.15 

0.097 1.76 

0.111 -1.48 

0.011 0.60 

0.354 1.80 

0.062 -1.36 

0.036 0.50 

0.000 <0.01 

0.023 0.66 

0.005 - 0.27 
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Figure 10-1 Normal probability plot of residuals. Example 
10-1. 


The least squares fit, with the regression coefficients reported to two decimal places, is 
y = 1566.08 + 7.62*! + 8.58* 2 

The first three columns of Table 10-3 (on the previous page) present the actual 
observations y h the predicted or fitted values y h and the residuals. Figure 10-1 is a normal 
probability plot of the residuals. Plots of the residuals versus the predicted values y t and 
versus the two variables jq and* 2 are shown in Figures 10-2,10-3, and 10-4, respectively. 
Just as in designed experiments, residual plotting is an integral part of regression model 
building. These plots indicate that there is a tendency for the variance of the observed 
viscosity to increase with the magnitude of viscosity. Figure 10-3 suggests that the vari¬ 
ability in viscosity is increasing as temperature increases. 



Figure 10-2 Plot of residuals versus predicted viscosity, Ex¬ 
ample 10-1. 






Using the Computer 

Regression model fitting is almost always done using a statistical software package. Table 
10-4 (page 402) shows some of the output obtained when Minitab is used to fit the 
viscosity regression model in Example 10-1. Many of the quantities in this output should 
be familiar because they have similar meanings to the quantities in the output displays 
for computer analysis of data from designed experiments. We have seen many such 
computer outputs previously in the book. In subsequent sections, we will discuss the 
analysis of variance and t test information in Table 10-4 in detail and will show exactly 
how these quantities were computed. 



8.00 8.83 9.67 10.50 11.33 12.17 13.00 


x 2 , catalyst feed rate 

Figure 10-4 Plot of residuals versus x 2 (feed rate). Example 
10-1. 
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Table 10-4 Minitab Output for the Viscosity Regression Model, Example 10-1 
Regression Analysis 

The regression equation is 

Viscosity = 1566 + 7.62 Temp + 8.58 Feed Rate 

Predictor Coef StDev T P 

Constant 1566.08 61.59 25.43 0.000 

Temp 7.6213 0.6184 12.32 0.000 

Feed Rat 8.585 2.439 3.52 0.004 

S = 16.36 R-Sq = 92.7% R-Sq(adj) = 91.6% 

Analysis of Variance 

Source DF SS MS F P 

Regression 2 44157 22079 82.50 0.000 

Residual Error 13 3479 268 

Total 15 47636 

Source DF SeqSS 

Temp 1 40841 

Feed Rat 1 3316 
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Fitting Regression Models in Designed Experiments 

We have often used a regression model to present the results of a designed experiment 
in a quantitative form. We now give a complete example showing how this is done. This 
is followed by three other brief examples that illustrate other useful applications of re¬ 
gression analysis in designed experiments. 


EXAMPLE 10-2 . 

Regression Analysis of a 2 3 Factorial Design 

A chemical engineer is investigating the yield of a process. Three process variables are 
of interest: temperature, pressure, and catalyst concentration. Each variable can be run 
at a low and a high level, and the engineer decides to run a 2 3 design with four center 
points. The design and the resulting yields are shown in Figure 10-5, where we have 
shown both the natural levels of the design factor and the +1, — 1 coded variable notation 
normally employed in 2* factorial designs to represent the factor levels. 

Suppose that the engineer decides to fit a main effects only model, say 


y = p 0 + 0!*! + p 2 x 2 + /3 3 *3 + e 


For this model the X matrix and y vector are 


"l -1 

-1 

-1 ■ 


32 

1 1 

-1 

-1 


46 

1 -1 

1 

-1 


57 

1 1 

1 

-1 


65 

1 -1 

-1 

1 


36 

1 1 

-1 

1 


48 

1 -1 

1 

1 

y = 

57 

1 1 

1 

1 


68 

1 0 

0 

0 


50 

1 0 

0 

0 


44 

1 0 

0 

0 


53 

1 0 

0 

0 


56 


It is easy to show that 


"12 

0 

0 

O' 


'612' 

0 

8 

0 

0 

x'y = 

45 

0 

0 

8 

0 

85 

_ 0 

0 

0 

8 


9 


Because X'X is diagonal, the required inverse is also diagonal, and the least squares 
estimates of the regression coefficients is 


'1/12 

0 

0 

0" 

"612' 


'51.000" 

0 

1/8 

0 

0 

45 


5.625 

0 

0 

1/8 

0 

85 


10.625 

0 

0 

0 

1/8 

9 


1.125 


The fitted regression model is 

y = 51.000 + 5.625x, + 10.625 jc 2 + 1.125a 3 
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As we have made use of on many occasions, the regression coefficients are closely 
connected to the effect estimates that would be obtained from the usual analysis of a 2 3 
design. For example, the effect of temperature is (refer to Figure 10-5) 

T = y T + — y T - 
= 56.75 - 45.50 
= 11.25 

Notice that the regression coefficient for x, is 

(11.25)/2 = 5.625 

That is, the regression coefficient is exactly one-half the usual effect estimate. This will 
always be true for a 2* design. As noted above, we used this result in Chapters 6 through 
8 to produce regression models, fitted values, and residuals for several two-level exper¬ 
iments. This example demonstrates that the effect estimates from a 2 k design are least 
squares estimates. 


In Example 10-2, the inverse matrix is easy to obtain because X'X is diagonal. 
Intuitively, this seems to be advantageous, not only because of the computational sim¬ 
plicity but also because the estimators of all the regression coefficients are uncorrelated; 
that is, Cov(jS„ &) = 0. If we can choose the levels of the x variables before the data 
are collected, we might wish to design the experiment so that a diagonal X'X will result. 

In practice, it can be relatively easy to do this. We know that the off-diagonal ele¬ 
ments in X'X are the sums of cross-products of the columns in X. Therefore, we must 
make the inner product of the columns of X equal to zero; that is, these columns must 
be orthogonal. Experimental designs that have this property for fitting a regression model 
are called orthogonal designs. In general, the 2 k factorial design is an orthogonal design 
for fitting the multiple linear regression model. 

Regression methods are extremely useful when something “goes wrong” in a de¬ 
signed experiment. This is illustrated in the next two examples. 


EXAMPLE 10-3 . 

A 2 3 Factorial Design with a Missing Observation 

Consider the 2 3 factorial design with four center points from Example 10-2. Suppose 
that when this experiment was performed, the run with all variables at the high level (run 
8 in Figure 10-5) was missing. This can happen for a variety of reasons; the measurement 
system can produce a faulty reading, the combination of factor levels may prove infea¬ 
sible, the experimental unit may be damaged, and so forth. 

We will fit the main effects model 

y = {} 0 + /3|X| + ji 2 x 2 + 03*3 + e 

using the 11 remaining observations. The X matrix and y vector are 
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"l -1 

-1 

-1 " 


32" 

1 1 

-1 

-1 


46 

1 -1 

1 

-1 


57 

1 1 

1 

-1 


65 

1 -1 

-1 

1 


36 

1 1 

-1 

1 

y = 

48 

1 -1 

1 

1 


57 

1 0 

0 

0 


50 

1 0 

0 

0 


44 

1 0 

0 

0 


53 

1 0 

0 

0 


56 


To estimate the model parameters, we form 


' 11 -1 -1 -1" 


' 544" 

-1 7 -1 -1 

X'y =. 

-23 

-1 -1 7 -1 

17 

_-l -1 -1 7_ 


-59 


and then 

p = (X'X) _1 X'y 


"9.61538 X 

10“ 2 1.92307 X 10“ 2 

1.92307 X 10“ 2 

1.92307 X 10- 2 " 

" 544" 

1.92307 X 

10" 2 0.15385 

2.88462 X 10“ 2 

2.88462 X 10~ 2 

-23 

1.92307 X 

10“ 2 2.88462 X 10“ 2 

0.15385 

2.88462 X 10~ 2 

17 

_ 1.92307 X 

10“ 2 2.88462 X 10~ 2 

2.88462 X 10~ 2 

0.15385 

_-59_ 


"51.25" 

5.75 
10.75 
_ 1.25_ 

Therefore, the fitted model is 

$ = 51.25 + 5.75*, + 10.75*2 +1.25x3 

Compare this model to the one obtained in Example 10-2, where all 12 observations 
were used. The regression coefficients are very similar. Because the regression coeffi¬ 
cients are closely related to the factor effects, our conclusions would not be seriously 
affected by the missing observation. However, notice that the effect estimates are no 
longer orthogonal because (X'X) and its inverse are no longer diagonal. 


EXAMPLE 10-4 . 

Inaccurate Levels in Design Factors 

When running a designed experiment, it is some tim es difficult to reach and hold the 
precise factor levels required by the design. Small discrepancies are not important, but 
large ones are potentially of more concern. Regression methods are useful in the analysis 
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Table 10-5 Experimental Design for Example 10-4 


Run 

Process Variables 


Coded Variables 

Yield 

y 

Temp (°C) 

Pressure 

(psig) 

(g/D 


x 2 

*3 

1 

125 

41 

14 

-0.75 

-0.95 

-1.133 

32 

2 

158 

40 

15 

0.90 

-1 

-1 

46 

3 

121 

82 

15 

-0.95 

1.1 

-1 

57 

4 

160 

80 

15 

1 

1 

-1 

65 

5 

118 

39 

33 

-1.10 

-1.05 

1.14 

36 

6 

163 

40 

30 

1.15 

-1 

1 

48 

7 

122 

80 

30 

-0.90 

1 

1 

57 

8 

165 

83 

30 

1.25 

1.15 

1 

68 

9 

140 

60 

22.5 

0 

0 

0 

50 

10 

140 

60 

22.5 

0 

0 

0 

44 

11 

140 

60 

22.5 

0 

0 

0 

53 

12 

140 

60 

22.5 

0 

0 

0 

56 


of a designed experiment where the experimenter has been unable to obtain the required 
factor levels. 

To illustrate, the experiment in Table 10-5 shows a variation of the 2 3 design from 
Example 10-2, where many of the test combinations are not exactly the ones specified 
in the design. Most of the difficulty seems to have occurred with the temperature variable. 

We will fit the main effects model 

y = j3 0 + 0i*i + j3 2 *2 + 03*3 + e 

The X matrix and y vector are 


' 1 -0.75 

-0.95 

-1.133' 


'32 ' 

1 0.90 

-1 

-1 


46 

1 -0.95 

1.1 

-1 


57 

1 1 

1 

-1 


65 

1 -1.10 

-1.05 

1.4 


36 

1 1.15 

-1 

1 


48 

1 -0.90 

1 

1 

y = 

57 

1 1.25 

1.15 

1 


68 

1 0 

0 

0 


50 

1 0 

0 

0 


44 

1 0 

0 

0 


53 

1 0 

0 

0 


56 


To estimate the model parameters, we need 


"12 

0.60 

0.25 

0.2670" 


"612 

0.60 

0.25 

8.18 

0.31 

0.31 

8.5375 

-0.1403 

-0.3437 

X'y = 

77.55 

161.50 

_ 0.2670 

-0.1403 

-0.3437 

9.2437 


19.144 


XX = 
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Then 

P = (X'X) _1 X'y 


8.37447 X 1(T 2 

-6.09871 X 10~ 3 

-2.33542 X 10“ 3 

-2.59833 X 10- 3- 

’612 

-6.09871 X 10 3 

0.12289 

-4.20766 X 10~ 3 

1.88490 X 10 3 

77.55 

-2.33542 X 10 3 

-4.20766 X 10“ 3 

0.11753 

4.37851 X 10 3 

161.50 

-2.59833 X 10" 3 

1.88490 X 10~ 3 

4.37851 X 10 3 

0.10845 

19.144 


"50.36496" 

5.41932 
10.16672 
_ 1.07653_ 

The fitted regression model, with the coefficients reported to two decimal places, is 
y = 50.36 + 2xi + 10.17x 2 + 1.08x 3 

Comparing this to the original model in Example 10-2, where the factor levels were 
exactly those specified by the design, we note very little difference. The practical inter¬ 
pretation of the results of this experiment would not be seriously affected by the inability 
of the experimenter to achieve the desired factor levels exactly. 


EXAMPLE 10-5 . 

De-aliasing Interactions in a Fractional Factorial 

We observed in Chapter 8 that it is possible to de-alias interactions in a fractional factorial 
design by a process called fold over. For a resolution III design, a full fold over is 
constructed by running a second fraction in which the signs are reversed from those in 
the original fraction. Then the combined design can be used to de-alias all main effects 
from the two-factor interactions. 

A difficulty with fold over is that it requires a second group of runs of identical size 
as the original design. It is usually possible to de-alias certain interactions of interest by 
augmenting the original design with fewer runs than required in a full fold over. Re¬ 
gression methods are an easy way to formulate this problem and see how to solve it. 

To illustrate, suppose that we have run a 2?v 1 design. Table 8-3 shows the principal 
fraction of this design, in which I = ABCD. Suppose that after the data from the first 
eight trials were observed, the largest effects were A, B, C, D (we ignore the three-factor 
interactions that are aliased with these main effects), and the AB + CD alias chain. The 
other two alias chains can be ignored, but clearly either AB, or CD, or both two-factor 
interactions are large. To find out which interactions are important we could, of course, 
mn the alternate fraction, which would require another eight trials. Then all 16 runs could 
be used to estimate the main effects and the two-factor interactions. 

It is possible to de-alias AB and CD in fewer than eight additional trials. Suppose 
that we wish to fit the model 


y ~ 00 + 01*1 + 0 2 *2 + 03*3 + 04*4 + 012*1*2 + 034*3*4 + 6 
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where x u x 2 , x 3 , and x 4 are the coded variables representing A, B, C, and D. Using the 
design in Table 8-3, the X matrix for this model is 

JCj X 2 x 3 X 4 X x X 2 X3X4 

\ -1 -1 -1 -1 1 f 

1 1-1-1 1-1 -1 

1-1 1-1 1-1 -1 

1 1 1-1-1 1 1 
A 1-1-11111 
1 1-1 1-1-1 -1 

1-1 1 1-1-1 -1 

11111 1 1, 

where we have written the variables above the columns to facilitate understanding. Notice 
that the x,x 2 column is identical to the x 3 x 4 column (as anticipated, because AB or x,x 2 
is aliased with CD or x 3 x 4 ), implying a linear dependency in the columns of X. Therefore 
we cannot estimate both /S 12 and fi 34 in the model. However, suppose that we add a single 
run X[ = - 1, x 2 = - 1 , x 3 = -1, and x 4 = 1 from the alternate fraction to the original 
eight runs. The X matrix for the model now becomes 

X\ X2 X3 X4 XjX2 X3X4 

1 -1 -1 -1 -1 1 r 

1 1-1-1 1-1 -1 

1-1 1-1 1-1 -1 

111-1-1 1 1 

x = 1-1—1 1 1 1 1 

1 1-1 1 —1—1 — 1 

1-1 1 1-1-1 -1 

11111 1 1 

.1 -1 -1-11 1 -1. 

Notice that the columns x,x 2 and x 3 x 4 are now no longer identical, and we can fit the 
model including both the x r x 2 (AB) and x 3 x 4 (CD) interactions. The magnitudes of the 
regression coefficients will give insight regarding which interactions are important. 

Although adding a single run will de-alias the AB and CD interactions, this approach 
does have a disadvantage. Suppose that there is a time effect (or a block effect) between 
the first eight runs and the last run added above. Add a column to the X matrix for blocks, 
and you obtain the following: 

X\ x 2 x 3 x 4 XiX 2 x 3 x 4 blocks 
'1 -1 -1 -1-1 1 1 -1] 

1 1 - 1-1 1-1 -1 -1 | 

1-1 1-1 1-1 -1 -1 

111-1-1 1 1 -1 

X= 1 -1 -1 1 1 1 1 -1 

1 1-1 1-1-1 -1 -1 

1-1 1 1-1-1 -1 -1 

1111111 -1 
.1 -1 -1 -1 1 


1 -1 
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We have assumed the block factor was at the low or level during the first eight 
runs, and at the high or “ + ” level during the ninth run. It is easy to see that the sum of 
the cross-products of every column with the block column does not sum to zero, meaning 
that blocks are no longer orthogonal to treatments, or that the block effect now affects 
the estimates of the model regression coefficients. To block orthogonally, you must add 
an even number of runs. For example, the four runs 


will de-alias AB from CD and allow orthogonal blocking (you can see this by writing 
out the X matrix as we did previously). 

In general, it is usually straightforward to examine the X matrix for the reduced 
model obtained from a fractional factorial and determine which runs to augment the 
original design with to de-alias interactions of potential interest. Furthermore, the impact 
of specific augmentation strategies can be evaluated using the general results for regres¬ 
sion models given later in this chapter. There are also computer-based methods for con¬ 
structing designs that can be useful for design augmentation to de-alias effects. We will 
discuss these computer-generated designs in the next chapter. 


10-4 HYPOTHESIS TESTING IN MULTIPLE REGRESSION 

In multiple linear regression problems, certain tests of hypotheses about the model pa¬ 
rameters are helpful in measuring the usefulness of the model. In this section, we describe 
several important hypothesis-testing procedures. These procedures require that the errors 
e, in the model be normally and independently distributed with mean zero and variance 
a 2 , abbreviated e ~ NID(0, a 2 ). As a result of this assumption, the observations y, are 
normally and independently distributed with mean j8 0 + and variance cr 2 . 


10-4.1 Test for Significance of Regression 

The test for significance of regression is a test to determine if there is a linear relationship 
between the response variable y and a subset of the regressor variables x u x 2 ,..., x k . 
The appropriate hypotheses are 

= P 2 = • • • = P k = 0 ( 10 - 20 ) 

Hi : /?, =£ 0 for at least one j 

Rejection of H 0 in Equation 10-20 implies that at least one of the regressor variables x ,, 
x 2 ,.. ., x k contributes significantly to the model. The test procedure involves an analysis 
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of variance partitioning of the total sum of squares SS T into a sum of squares due to the 
model (or to regression) and a sum of squares due to residual (or error), say 

SS T = SS R + SS E (10-21) 

Now if the null hypothesis H 0 : /3 X = p 2 = • • • = P k = 0 is true, then SS R /ar 2 is distributed 
as xl, where the number of degrees of freedom for are equal to the number of regressor 
variables in the model. Also, we can show that SS E /<r 2 is distributed as Xn-k-i ar| d that 
SS E and SS R are independent. The test procedure for H 0 'P\ = /3 2 = • ■ • = = 0 is to 

compute 


SS R /k _ MS r 
SS E /( n - k - 1) MS e 


( 10 - 22 ) 


and to reject H 0 if F 0 exceeds Alternatively, we could use the P-value approach 

to hypothesis testing and, thus, reject H 0 if the P-value for the statistic F 0 is less than a. 
The test is usually summarized in an analysis of variance table such as Table 10-6. 

A computational formula for SS R may be found easily. We have derived a compu¬ 
tational formula for SS E in Equation 10-16—that is, 


SS E = y'y - p'X'y 

Now, because SS T = 2” =1 yf - (2? =1 y,) 2 /n = y'y - (2?., y,) 2 /n, we may rewrite the 
foregoing equation as 



or 


SS E = SS T - SS R 

Therefore, the regression sum of squares is 



and the error sum of squares is 

SS E = y'y - p'X'y 


(10-23) 


(10-24) 


Table 10-6 Analysis of Variance for Significance of Regression in Multiple Regression 


Source of 
Variation 

Sum of Squares 

Degrees of 
Freedom 

Mean Square 

F 0 

Regression 

SS R 

k 

MS r 

ms r /ms e 

Error or residual 

SS E 

n-k - 1 

MS e 


Total 

SS T 

n-\ 
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and the total sum of squares is 


SS T = y'y - 



(10-25) 


These computations are almost always performed with regression software. For in¬ 
stance, Table 10-4 shows some of the output from Minitab for the viscosity regression 
model in Example 10-1. The upper portion in this display is the analysis of variance for 
the model. The test of significance of regression in this example involves the hypotheses 

H 0 '-Pi = P 2 = 0 

Hi: Pj =£ 0 for at least one j 

The P-value in Table 10-4 for the F statistic (Equation 10-22) is very small, so we would 
conclude that at least one of the two variables—temperature (x t ) and feed rate (x 2 )—has 
a nonzero regression coefficient. 

Table 10-4 also reports the coefficient of multiple determination R 2 , where 


R 2 


SSr 

SS t 


SSe 

SS t 


(10-26) 


Just as in designed experiments, R 2 is a measure of the amount of reduction in the 
variability of y obtained by using the regressor variables x u x 2 ,, x k in the model. 
However, as we have noted previously, a large value of R 2 does not necessarily imply 
that the regression model is a good one. Adding a variable to the model will always 
increase R 2 , regardless of whether the additional variable is statistically significant or 
not. Thus it is possible for models that have large values of R 2 to yield poor predictions 
of new observations or estimates of the mean response. 

Because R 2 always increases as we add terms to the model, some regression model 
builders prefer to use an adjusted R 2 statistic defined as 


R 


SS E /(n - p) 
SS T /( n - 1) 



R 2 ) 


(10-27) 


In general, the adjusted R 2 statistic will not always increase as variables are added to the 
model. In fact, if unnecessary terms are added, the value of Rl dj will often decrease. 

For example, consider the viscosity regression model. The adjusted R 2 for the model 
is shown in Table 10-4. It is computed as 


Rl * = i 


= l 


(lf) (1 “ °- 92697) 


= 0.915735 


which is very close to the ordinary R 2 . When R 2 and R 2 d] differ dramatically, there is a 
good chance that nonsignificant terms have been included in the model. 
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10-4.2 Tests on Individual Regression Coefficients 
and Groups of Coefficients 


We are frequently interested in testing hypotheses on the individual regression coeffi¬ 
cients. Such tests would be useful in determining the value of each of the regressor 
variables in the regression model. For example, the model might be more effective with 
the inclusion of additional variables or perhaps with the deletion of one or more of the 
variables already in the model. 

Adding a variable to the regression model always causes the sum of squares for 
regression to increase and the error sum of squares to decrease. We must decide whether 
the increase in the regression sum of squares is sufficient to warrant using the additional 
variable in the model. Furthermore, adding an unimportant variable to the model can 
actually increase the mean square error, thereby decreasing the usefulness of the model. 

The hypotheses for testing the significance of any individual regression coefficient, 
say j8 >j, are 

Ho'-Pj = 0 

Hi'.pj A 0 

If Ho'.fij = 0 is not rejected, then this indicates that x y can be deleted from the model. 
The test statistic for this hypothesis is 


& 


(10-28) 


where C# is the diagonal element of (X'X) -1 corresponding to fy. The null hypothesis 
H 0 : Pj = 0 is rejected if | r 0 1 > t a/2 , n - k -i- Note that this is really a partial or marginal 
test, because the regression coefficient jS, depends on all the other regressor variables 

Xi (i A j) that are in the model. _ 

The denominator of Equation 10-28, V& 2 C jp is often called the standard error 
of the regression coefficient p r That is, 

se(pj) = Vd%j (10-29) 

Therefore, an equivalent way to write the test statistic in Equation (10-28) is 


A 

se(pj) 


(10-30) 


Most regression computer programs provide the t test for each model parameter. For 
example, consider Table 10-4, which contains the Minitab output for Example 10-1. The 
upper portion of this table gives the least squares estimate of each parameter, the standard 
error, the t statistic, and the corresponding P-value. We would conclude that both vari¬ 
ables, temperature and feed rate, contribute significantly to the model. 

We may also directly examine the contribution to the regression sum of squares for 
a particular variable, say x y , given that other variables x, (i ¥= j) are included in the model. 
The procedure for doing this is the general regression significance test or, as it is often 
called, the extra sum of squares method. This procedure can also be used to investigate 
the contribution of a subset of the regressor variables to the model. Consider the regres¬ 
sion model with k regressor variables: 


y = x/3 + € 
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where y is (n X 1), X is (n X p), p is (p X 1), e is (n X 1), and p = k + 1. We would 
like to determine if the subset of regressor variables x u x 2 ,..., x r (r < k) contribute 
significantly to the regression model. Let the vector of regression coefficients be parti¬ 
tioned as follows: 



where p x is (r X 1) and fi 2 is \(p - r) X 1], We wish to test the hypotheses 

H 0 : jS, = 0 

(10-31) 

The model may be written as 


y = Xp + e = X,/?! + X 2 p 2 + e (10-32) 

where X, represents the columns of X associated with p x and X 2 represents the columns 
of X associated with p 2 . 

For the full model (including both /?, and p 2 ), we know that p = (X'X) _1 X'y. Also, 
the regression sum of squares for all variables including the intercept is 

SS R (P) = P'X'y (p degrees of freedom) 
and 


M s E = iL=J*1 
n- P 

SS R (p) is called the regression sum of squares due to p. To find the contribution of the 
terms in p x to the regression, we fit the model assuming the null hypothesis H 0 :p { = 0 
to be true. The reduced model is found from Equation 10-32 with jSj = 0: 

y = X 2 p 2 + € (10-33) 


The least squares estimator of p 2 is p 2 = (X 2 X 2 ) 'X 2 y, and 

SS R (p 2 ) = p 2 X 2 y (p - r degrees of freedom) (10-34) 

The regression sum of squares due to /S, given that p 2 is already in the model is 

SS R (pi | p 2 ) = SS R (p) - SS R (p 2 ) (10-35) 

This sum of squares has r degrees of freedom. It is the “extra sum of squares” due to 
Pi ■ Note that SS R (P X \ p 2 ) is the increase in the regression sum of squares due to including 
the variables x u x 2 ,..., x r in the model. 

Now, SS R (Pi | p 2 ) is independent of MS E , and the null hypothesis /3, = 0 may be 
tested by the statistic 


_ SS R {Pi\p 2 )lr 
MS e 


(10-36) 


If F 0 > p , we reject H 0 , concluding that at least one of the parameters in /3, is not 
zero and, consequently, at least one of the variables x u x 2 ,..., x, in X! contributes 
significantly to the regression model. Some authors call the test in Equation 10-36 a 
partial F test. 
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The partial F test is very useful. We can use it to measure the contribution of Xj as 
if it were the last variable added to the model by computing 

SS R { ftlft, ft, • • •, fij-u P J+ i, ■ ■ •, ft) 

This is the increase in the regression sum of squares due to adding Xj to a model that 
already includes x u ... ,Xj-i,x j+u ..., x k . Note that the partial F test on a single variable 
Xj is equivalent to the t test in Equation 10-28. However, the partial F test is a more 
general procedure in that we can measure the effect of sets of variables. 

EXAMPLE 10-6 . 

Consider the viscosity data in Example 10-1. Suppose that we wish to investigate the 
contribution of the variable x 2 (feed rate) to the model. That is, the hypotheses we wish 
to test are 

ft:/ft = 0 
ft:ft * 0 

This will require the extra sum of squares due to ft, or 

SS R (p 2 1 ft, ft) = SS R ( ft, Pi, p 2 ) - SS R (P 0 , Pi) 

= SS R (Pi, ft I/So) - SS R (P 2 \p 0 ) 

Now from Table 10-4, where we tested for significance of regression, we have 
SS R (P U ft | ft) = 44,157.1 

which was called the model sum of squares in the table. This sum of squares has two 
degrees of freedom. 

The reduced model is 

y = ft + ft*! + e 

The least squares fit for this model is 

y = 1652.3955 + 7.6397jc, 

and the regression sum of squares for this model (with one degree of freedom) is 
Sft(ftlft) = 40,840.8 

Note that SS R (P, | ft) is shown at the bottom of the Minitab output in Table 10-4 under 
the heading “Seq SS.” Therefore, 

SS r (J3 2 | ft, ft) = 44,157.1 - 40,840.8 
= 3316.3 

with 2—1 = 1 degree of freedom. This is the increase in the regression sum of squares 
that results from adding jc 2 to a model already containing x u and it is shown at the bottom 
of the Minitab output on Table 10-4. To test ft: ft = 0, from the test statistic we obtain 

„ Sft(ft|ft,ft)/1 3316.3/1 

Fo “ MS, 26T604 " 123926 

Note that MS, from the full model (Table 10-4) is used in the denominator of F 0 . Now, 
because f 0 . 05 , 1,13 = 1.67, we would reject ft:ft = 0 and conclude that x 2 (feed rate) 
contributes significantly to the model. 
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Because this partial F test involves only a single regressor, it is equivalent to the t 
test because the square of a t random variable with v degrees of freedom is an F random 
variable with 1 and v degrees of freedom. To see this, note from Table 10-4 that the t 
statistic for H 0 : ft = 0 resulted in t 0 = 3.5203 and that tl = (3.5203) 2 = 12.3925 — F 0 . 


10-5 CONFIDENCE INTERVALS IN MULTIPLE REGRESSION 

It is often necessary to construct confidence interval estimates for the regression coeffi¬ 
cients {ft} and for other quantities of interest from the regression model. The develop¬ 
ment of a procedure for obtaining these confidence intervals requires that we assume the 
errors {e, } to be normally and independently distributed with mean zero and variance 
cr 2 , the same assumption made in the section on hypothesis testing in Section 10-4. 


10-5.1 Confidence Intervals on the Individual Regression Coefficients 

Because the least squares estimator /3 is a linear combination of the observations, it 
follows that /3 is normally distributed with mean vector (i and covariance matrix 
<r 2 (X'X) -1 . Then each of the statistics 

<‘°- 37) 

is distributed as t with n — p degrees of freedom, where Cjj is the (jj )th element of the 
(X'X) -1 matrix, and & 2 is the estimate of the error variance, obtained from Equation 10- 
17. Therefore, a 100(1 — a) percent confidence interval for the regression coefficient ft, 
j = 0 , 1 , ...,&, is 

ft - t a/ 2 ,n-pV& 2 Cjj ss ft « ft + f a /2,„- P Vd- 2 Q (10-38) 

Note that this confidence interval could also be written as 

ft - t a/ 2, n -pSe((3j) $ ft s£ ft + t a /2, n -pse(frj) 
because se(ft) = V d- 2 Cjj. 

EXAMPLE 10-7 . 

We will construct a 95 percent confidence interval for the parameter ft in Example 10- 
1. Now ft = 7.62129, and because & 2 = 267.604 and C n = 1.429184 X 10 -3 , we find 
that 

ft — fo.025.13V & 2 C n ^ ft ^ Pi + fo.025.13V & 2 C\ 1 

7.62129 - 2.16V(267.604)(1.429184 X 10“ 3 ) ^ ft 

^ 7.62129 + 2.16V(267.604)(1.429184 X 10 3 ) 
7.62129 - 2.16(0.6184) ^ ft ^ 7.62129 + 2.16(0.6184) 
and the 95 percent confidence interval on ft is 

6.2855 ^ ft ^ 8.9570 
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10-5.2 Confidence Interval on the Mean Response 

We may also obtain a confidence interval on the mean response at a particular point, say, 
Toi, *02, • • •, x ok- We first define the vector 


1 


*01 

*02 


*0 * 


The mean response at this point is 

fly | X () = Po + j3i*oi + @2 X 02 + • • • + f$k x 0k = x 0 /3 
The estimated mean response at this point is 

$(Xo) = x () /3 (10-39) 

This estimator is unbiased, because £[y(Xo)] = £(x£/3) = x () /3 = ^ ]Xo , and the variance 
of y(x 0 ) is 

IWo)] = <* 2 Xo(X'X) -1 Xo (10-40) 

Therefore, a 100(1 - a) percent confidence interval on the mean response at the point 
*01, *02, .... *0* is 

Kxo) - U- P VWXf^ < ^ |x 0 ,_ 

^ y(Xo) + fa/ 2 ,«- P Vd- 2 x^(X'X) l x Q (10-41) 


10-6 PREDICTION OF NEW RESPONSE OBSERVATIONS 

A regression model can be used to predict future observations on the response y corre¬ 
sponding to particular values of the regressor variables, say * 0 i, *02, • • •, *0 *• If x o = 
[ 1 , *oi, *02, •••, *0*], then a point estimate for the future observation y 0 at the point 
*oi> *02, .... *<» is computed from Equation 10-39: 

y(Xo) = Xo/3 

A 100(1 - a) percent prediction interval for this future observation is 

y(x 0 ) - t a , 2 „-p\/& 2 (l + Xo(X'Xr‘xo) ^ y 0 

« y(Xo) + t al2 , n - P V&\l + Xo(X'X) _1 x 0 ) (10-42) 

In predicting new observations and in estimating the mean response at a given point 
* 01 , *02, • • •, *0*, we must be careful about extrapolating beyond the region containing 
the original observations. It is very possible that a model that fits well in the region of 
the original data will no longer fit well outside of that region. 


10-7 REGRESSION MODEL DIAGNOSTICS 

As we emphasized in designed experiments, model adequacy checking is an important 
part of the data analysis procedure. This is equally important in building regression 
models, and as we illustrated in Example 10-1, the residual plots that we used with 
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designed experiments should always be examined for a regression model. In general, it 
is always necessary to (1) examine the fitted model to ensure that it provides an adequate 
approximation to the true system and (2) verify that none of the least squares regression 
assumptions are violated. The regression model will probably give poor or misleading 
results unless it is an adequate fit. 

In addition to residual plots, there are other model diagnostics that are frequently 
useful in regression. This section briefly summarizes some of these procedures. For more 
complete presentations, see Montgomery and Peck (1992) and Myers (1990). 


10-7.1 Scaled Residuals and PRESS 
Standardized and Studentized Residuals 

Many model builders prefer to work with scaled residuals in contrast to the ordinary 
least squares residuals. These scaled residuals often convey more information than do 
the ordinary residuals. 

One type of scaled residual is the standardized residual: 

d ,, = 5 i= 1,2,..., n (10-43) 

tr 

where we generally use & = VMS £ in the computation. These standardized residuals 
have mean zero and approximately unit variance; consequently, they are useful in looking 
for outliers. Most of the standardized residuals should lie in the interval -3 d t =£ 3, 
and any observation with a standardized residual outside of this interval is potentially 
unusual with respect to its observed response. These outliers should be carefully exam¬ 
ined, because they may represent something as simple as a data-recording error or some¬ 
thing of more serious concern, such as a region of the regressor variable space where the 
fitted model is a poor approximation to the true response surface. 

The standardizing process in Equation (10-43) scales the residuals by dividing them 
by their approximate average standard deviation. In some data sets, residuals may have 
standard deviations that differ greatly. We now present a scaling that takes this into 
account. 

The vector of fitted values y, corresponding to the observed values y t is 

y = x/3 

= X(X'X) _1 X'y 

= Hy (10-44) 

The n X n matrix H = X(X'X) -1 X' is usually called the “hat” matrix because it maps 
the vector of observed values into a vector of fitted values. The hat matrix and its prop¬ 
erties play a central role in regression analysis. 

The residuals from the fitted model may be conveniently written in matrix nota¬ 
tion as 


e = y - y 

and it turns out that the covariance matrix of the residuals is 

Cov(e) = cr 2 (I - H) (10-45) 

The matrix I — H is generally not diagonal, so the residuals have different variances and 
they are correlated. 
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Thus, the variance of the ith residual is 

V(ed = o- 2 (l - h h ) (10-46) 

where h u is the ith diagonal element of H. Because 0 =£ h u *£ 1, using the residual mean 
square MS t to estimate the variance of the residuals actually overestimates V(e,). Fur¬ 
thermore, because h u is a measure of the location of the ith point in x- space, the variance 
of e, depends on where the point x,- lies. Generally, residuals near the center of the x 
space have larger variance than do residuals at more remote locations. Violations of 
model assumptions are more likely at remote points, and these violations may be hard 
to detect from inspection of e, (or dd because their residuals will usually be smaller. 

We recommend taking this inequality of variance into account when scaling the 
residuals. We suggest plotting the studentized residuals: 


Vd- 2 (1 — 


i = 1,2,..., n 


(10-47) 


with & 2 = MS e instead of e, (or dd- The studentized residuals have constant variance 
V(rd = 1 regardless of the location of x, when the form of the model is correct. In many 
situations the variance of the residuals stabilizes, particularly for large data sets. In these 
cases there may be little difference between the standardized and studentized residuals. 
Thus standardized and studentized residuals often convey equivalent information. How¬ 
ever, because any point with a large residual and a large /i„ is potentially highly influential 
on the least squares fit, examination of the studentized residuals is generally recom¬ 
mended. Table 10-3 displays the hat diagonals h u and the studentized residuals for the 
viscosity regression model in Example 10-1. 


PRESS Residuals 

The prediction error sum of squares (PRESS) provides a useful residual scaling. To 
calculate PRESS, we select an observation—for example, i. We fit the regression model 
to the remaining n — 1 observations and use this equation to predict the withheld ob¬ 
servation y,. Denoting this predicted value $,> we may find the prediction error for point 
i as e (0 = y, — y (l) . The prediction error is often called the ith PRESS residual. This 
procedure is repeated for each observation i = 1,2,... ,n, producing a set of n PRESS 
residuals e m , e (2) ,..., e (n) . Then the PRESS statistic is defined as the sum of squares of 
the n PRESS residuals as in 

PRESS = 2 4 = J) [y, - y (0 ] 2 (10-48) 

Thus PRESS uses each possible subset of n — 1 observations as an estimation data set, 
and every observation in turn is used to form a prediction data set. 

It would initially seem that calculating PRESS requires fitting n different regressions. 
However, it is possible to calculate PRESS from the results of a single least squares fit 
to all n observations. It turns out that the ith PRESS residual is 


_ e, 

?(<) = Y^hu 


(10-49) 


Thus because PRESS is just the sum of the squares of the PRESS residuals, a simple 
computing formula is 


PRESS = 



(10-50) 
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From Equation 10-49 it is easy to see that the PRESS residual is just the ordinary residual 
weighted according to the diagonal elements of the hat matrix h„. Data points for which 
h u are large will have large PRESS residuals. These observations will generally be high 
influence points. Generally, a large difference between the ordinary residual and the 
PRESS residuals will indicate a point where the model fits the data well, but a model 
built without that point predicts poorly. In the next section we will discuss some other 
measures of influence. 

Finally, we note that PRESS can be used to compute an approximate R 2 for predic¬ 
tion, say 


PRESS 

S„ 


(10-51) 


This statistic gives some indication of the predictive capability of the regression model. 
For the viscosity regression model from Example 10-1, we can compute the PRESS 
residuals using the ordinary residuals and the values of found in Table 10-3. The 
corresponding value of the PRESS statistic is PRESS = 5207.7. Then 


47,635.9 
= 0.8907 


Therefore we could expect this model to “explain” about 89 percent of the variability 
in predicting new observations, as compared to the approximately 93 percent of the 
variability in the original data explained by the least squares fit. The overall predictive 
capability of the model based on this criterion seems very satisfactory. 


R-Student 

The studentized residual r, discussed above is often considered an outlier diagnostic. It 
is customary to use MS E as an estimate of a 2 in computing r,. This is referred to as 
internal scaling of the residual because MS E is an internally generated estimate of cr 2 
obtained from fitting the model to all n observations. Another approach would be to use 
an estimate of cr 2 based on a data set with the ith observation removed. We denote the 
estimate of cr 2 so obtained by S 2 iy We can show that 


n2 (n - p)MS E - e 2 l(\ ~ h u ) 
5(0 ~ n-p- 1 


(10-52) 


The estimate of cr 2 in Equation 10-52 is used instead of MS E to produce an externally 
studentized residual, usually called R-student, given by 


’ VSg)(l - h id 


(10-53) 


In many situations, t, will differ little from the studentized residual r,. However, if 
the ith observation is influential, then S 2 } can differ significantly from MS E , and thus the 
R-student will be more sensitive to this point. Furthermore, under the standard assump¬ 
tions, t, has a t n - p -1 distribution. Thus R-student offers a more formal procedure for 
outlier detection via hypothesis testing. Table 10-3 displays the values of R-student for 
the viscosity regression model in Example 10-1. None of those values is unusually large. 
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10*7.2 Influence Diagnostics 

We occasionally find that a small subset of the data exerts a disproportionate influence 
on the fitted regression model. That is, parameter estimates or predictions may depend 
more on the influential subset than on the majority of the data. We would like to locate 
these influential points and assess their impact on the model. If these influential points 
are “bad” values, they should be eliminated. On the other hand, there may be nothing 
wrong with these points. But if they control key model properties, we would like to know 
it, because it could affect the use of the model. In this section we describe and illustrate 
some useful measures of influence. 

Leverage Points 

The disposition of points in x space is important in determining model properties. In 
particular, remote observations potentially have disproportionate leverage on the param¬ 
eter estimates, predicted values, and the usual summary statistics. 

The hat matrix H = X(X'X)”'X' is very useful in identifying influential observa¬ 
tions. As noted earlier, H determines the variances and covariances of y and e, because 
V(y) = <r 2 H and Vfe) = cr 2 (I - H). The elements /i, 7 of H may be interpreted as the 
amount of leverage exerted by y y on y,. Thus, inspection of the elements of H can reveal 
points that are potentially influential by virtue of their location in x space. Attention is 
usually focused on the diagonal elements ha. Because i ha = rank(H) = rank(X) = 
p, the average size of the diagonal element of the H matrix is pin. As a rough guideline, 
then, if a diagonal element /i„ is greater than 2 pin, observation i is a high-leverage point. 
To apply this to the viscosity model in Example 10-1, note that 2 p/n = 2(3)/16 = 0.375. 
Table 10-3 gives the hat diagonals /i„ for the first-order model; because none of the h a 
exceeds 0.375, we would conclude that there are no leverage points in these data. 

Influence on Regression Coefficients 

The hat diagonals will identify points that are potentially influential due to their location 
in x space. It is desirable to consider both the location of the point and the response 
variable in measuring influence. Cook (1977, 1979) has suggested using a measure of 
the squared distance between the least squares estimate based on all n points ji and the 
estimate obtained by deleting the / point, say jS (0 . This distance measure can be expressed 


(ftp - fl) f X'X(ft 0 - / 
pMS E 


(10-54) 


A reasonable cutoff for D, is unity. That is, we usually consider observations for which 
D, > 1 to be influential. 

The D , statistic is actually calculated from 


if V[y(x,)] r 2 h u 

p Vie?, pi 1 - hi,) 


i= 1, 2,..., n 


(10-55) 


Note that, apart from the constant p, D, is the product of the square of the ith studentized 
residual and h u li 1 - h it ). This ratio can be shown to be the distance from the vector x, 
to the centroid of the remaining data. Thus, D, is made up of a component that reflects 
how well the model fits the ith observation y, and a component that measures how far 
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that point is from the rest of the data. Either component (or both) may contribute to a 
large value of D,. 

Table 10-3 presents the values of £>, for the regression model fit to the viscosity data 
in Example 10-1. None of these values of D, exceeds 1, so there is no strong evidence 
of influential observations in these data. 


10-8 TESTING FOR LACK OF FIT 

In Section 6-6 we showed how adding center points to a 2 k factorial design allows the 
experimenter to obtain an estimate of pure experimental error. This allows the partitioning 
of the residual sum of squares SS E into two components; that is 

SS E — SS PE + SSlof 

where SS PE is the sum of squares due to pure error and SS LO f is the sum of squares due 
to lack of fit. 

We may give a general development of this partitioning in the context of a regression 
model. Suppose that we have n, observations on the response at the ith level of the 
regressors x,, i = 1,2,..., m. Let y i} denote the jth observation on the response at x„ 
i= 1, 2,..., m sndj = 1, 2,..., There are n = «, total observations. We may 

write the (y)th residual as 

ytj ~ Si = (yy - Si) + (.Si - yd (10-56) 

where Si is the average of the n, observations at x, . Squaring both sides of Equation 10- 
56 and summing over i and j yields 

£ £ (3 ’ij - Si) 2 = £ £ (y,j - Si) 2 + £ "iiSi - Si) 2 (10-57) 

1=1 j= 1 (=1 j— 1 1=1 

The left-hand side of Equation 10-57 is the usual residual sum of squares. The two 
components on the right-hand side measure pure error and lack of fit. We see that the 
pure error sum of squares 

SS PE = £ £ (y v ~ Si) 2 (10-58) 

i=i j= i 

is obtained by computing the corrected sum of squares of the repeat observations at each 
level of x and then pooling over the m levels of x. If the assumption of constant variance 
is satisfied, this is a model-independent measure of pure error because only the vari¬ 
ability of the y’s at each x, level is used to compute SS PE . Because there are n, - 1 
degrees of freedom for pure error at each level x„ the total number of degrees of freedom 
associated with the pure error sum of squares is 

2 («i “ 1) = n - m (10-59) 

The sum of squares for lack of fit 


SSlof = 2 «,(.}'/ - Sif 


( 10 - 60 ) 
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is a weighted sum of squared deviations between the mean response y, at each x, level 
and the corresponding fitted value. If the fitted values y t are close to the corresponding 
average responses y h then there is a strong indication that the regression function is linear. 
If the deviate greatly from the y„ then it is likely that the regression function is not 
linear. There are m - p degrees of freedom associated with SS LO f because there are m 
levels of x, and p degrees of freedom are lost because p parameters must be estimated 
for the model. Computationally we usually obtain SS LOF by subtracting SS PE from SS E . 
The test statistic for lack of fit is 

_ SS LOF /(rn — p ) _ MS lof (10-61) 

0 “ SS PE /(n - m) MS pe 

The expected value of MS PE is a 2 , and the expected value of MS LOF is 
S n, [w - J8 0 - 2 PjXv\ 

E(MS lof ) = o- 2 + —- m _ 2 — - (10 ' 62) 

If the true regression function is linear, then E(y,) = /3 0 + 2j=i fax 0 , and the second 
term of Equation 10-62 is zero, resulting in E(MS LOF ) = o- 2 . However, if the true re¬ 
gression function is not linear, then E(y,) ^ p 0 + PjXij, and E(MSlof ) > & ■ Fur¬ 

thermore, if the true regression function is linear, then the statistic F 0 follows the F m - P ,„~ m 
distribution. Therefore, to test for lack of fit, we would compute the test statistic F 0 and 
conclude that the regression function is not linear if F 0 > 

This test procedure may be easily incorporated into the analysis of variance. If we 
conclude that the regression function is not linear, then the tentative model must be 
abandoned and attempts made to find a more appropriate equation. Alternatively, if F 0 
does not exceed F a , m - p , n - m , there is no strong evidence of lack of fit and MS PE and MS LOF 
are often combined to estimate a 2 . Example 6-6 is a very complete illustration of this 
procedure, where the replicate runs are center points in a 2 2 factorial design. 

10-9 PROBLEMS- 

10-1 The tensile strength of a paper product is related to the amount of hardwood in the pulp. 
Ten samples are produced in the pilot plant, and the data obtained are shown in the 
following table. 


Strength 

Percent Hardwood 

Strength 

Percent Hardwood 

160 

10 

181 

20 

171 

15 

188 

25 

175 

15 

193 

25 

182 

20 

195 

28 

184 

20 

200 

30 


(a) Fit a linear regression model relating strength to percent hardwood. 

(b) Test the model in part (a) for significance of regression. 

(c) Find a 95 percent confidence interval on the parameter /3,. 

10-2. A plant distills liquid air to produce oxygen, nitrogen, and argon. The percentage of 
impurity in the oxygen is thought to be linearly related to the amount of impurities in 
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the air as measured by the “pollution count” in parts per million (ppm). A sample of 
plant operating data is shown below: 


Purity (%) 

1 93.3 

92.0 

92.4 

91.7 

94.0 

94.6 

93.6 

Pollution count (ppm) 

1 1.10 

1.45 

1.36 

1.59 

1.08 

0.75 

1.20 

93.1 

93.2 

92.9 

92.2 

91.3 

90.1 

91.6 

91.9 

0.99 

0.83 

1.22 

1.47 

1.81 

2.03 

1.75 

1.68 


(a) Fit a linear regression model to the data. 

(b) Test for significance of regression. 

(c) Find a 95 percent confidence interval on /3[. 

10-3. Plot the residuals from Problem 10-1 and comment on model adequacy. 

10-4. Plot the residuals from Problem 10-2 and comment on model adequacy. 

10-5. Using the results of Problem 10-1, test the regression model for lack of fit. 

10-6. A study was performed on wear of a bearing y and its relationship to x, = oil viscosity 
and x 2 = load. The following data were obtained: 



y 

Xi x 2 



193 

1.6 851 



230 

15.5 816 



172 

22.0 1058 



91 

43.0 1201 



113 

33.0 1357 



125 

40.0 1115 


(a) Fit a multiple linear regression model to the data. 

(b) Test for significance of regression. 

(c) Compute t statistics for each model parameter. What conclusions can you draw? 
The brake horsepower developed by an automobile engine on a dynomometer is thought 
to be a function of the engine speed in revolutions per minute (rpm), the road octane 
number of the fuel, and the engine compression. An experiment is run in the laboratory 
and the data that follow are collected: 

Brake Horsepo 1 

we r rpm 

Road Octane Number 

Compression 

225 

2000 

90 

100 

212 

1800 

94 

95 

229 

2400 

88 

110 

222 

1900 

91 

96 

219 

1600 

86 

100 

278 

2500 

96 

110 

246 

3000 

94 

98 

237 

3200 

90 

100 

233 

2800 

88 

105 

224 

3400 

86 

97 

223 

1800 

90 

100 

230 

2500 

89 

104 
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(a) Fit a multiple regression model to these data. 

(b) Test for significance of regression. What conclusions can you draw? 

(c) Based on t tests, do you need all three regressor variables in the model? 

10-8. Analyze the residuals from the regression model in Problem 10-7. Comment on model 
adequacy. 

10-9. The yield of a chemical process is related to the concentration of the reactant and 
the operating temperature. An experiment has been conducted with the following 
results. 


Yield 

Concentration 

Temperature 

81 

1.00 

150 

89 

1.00 

180 

83 

2.00 

150 

91 

2.00 

180 

79 

1.00 

150 

87 

1.00 

180 

84 

2.00 

150 

90 

2.00 

180 


(a) Suppose we wish to fit a main effects model to this data. Set up the X X matrix 
using the data exactly as it appears in the table. 

(b) Is the matrix you obtained in part (a) diagonal? Discuss your response. 

(c) Suppose we write our model in terms of the “usual” coded variables 

Cone —1.5 Temp - 165 

-55— * 2 = -I?— 

Set up the X'X matrix for the model in terms of these coded variables. Is this matrix 
diagonal? Discuss your response. 

(d) Define a new set of coded variables 

Cone — 1.0 _ Temp — 150 

Xl = L0 * 2 “ 30 

Set up the X'X matrix for the model in terms of this set of coded variables. Is this 
matrix diagonal? Discuss your response. 

(e) Summarize what you have learned from this problem about coding the variables. 
10-10. Consider the 2 4 factorial experiment in Example 6-2. Suppose that the last observation 

is missing. Reanalyze the data and draw conclusions. How do these conclusions compare 
with those from the original example? 

10-11. Consider the 2 4 factorial experiment in Example 6-2. Suppose that the last two obser¬ 
vations are missing. Reanalyze the data and draw conclusions. How do these conclusions 
compare with those from the original example? 

10-12. Given the following data, fit the second-order polynomial regression model 


y = p 0 + /3|JC] + P2X2 + )3 u Xi + /3 2 2*2 + P\2 x i x 2 + e 
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_y _ *1 _ xi_ 

26 1.0 1.0 

24 1.0 1.0 

175 1.5 4.0 

160 1.5 4.0 

163 1.5 4.0 

55 0.5 2.0 

62 1.5 2.0 

100 0.5 3.0 

26 1.0 1.5 

30 0.5 1.5 

70 1.0 2.5 

71 0.5 2.5 


After you have fit the model, test for significance of regression. 

10-13. (a) Consider the quadratic regression model from Problem 10-12. Compute t statistics 

for each model parameter and comment on the conclusions that follow from these 
quantities. 

(b) Use the extra sum of squares method to evaluate the value of the quadratic terms 
x\, x\, and x,x 2 to the model. 

10-14. Relationship between analysis of variance and regression. Any analysis of variance 
model can be expressed in terms of the general linear model y = x/3 + e, where the X 
matrix consists of 0s and Is. Show that the single-factor model y u = p + t, + e y , i = 
1, 2, 3, j = 1, 2, 3,4 can be written in general linear model form. Then 

(a) Write the normal equations (X’X)fi = X'y and compare them with the normal 
equations found for this model in Chapter 3. 

(b) Find the rank of XX. Can (X'X)~ 1 be obtained? 

(c) Suppose the first normal equation is deleted and the restriction nf, = 0 is 
added. Can the resulting system of equations be solved? If so, find the solution. 
Find the regression sum of squares fi'X'y, and compare it to the treatment sum of 
squares in the single-factor model. 

10-15. Suppose that we are fitting a straight line and we desire to make the variance of fit as 
small as possible. Restricting ourselves to an even number of experimental points, where 
should we place these points so as to minimize Vlfifil [Note: Use the design called for 
in this exercise with great caution because, even though it minimizes l/f/S,), it has some 
undesirable properties; for example, see Myers and Montgomery (1995). Only if you are 
very sure the true functional relationship is linear should you consider using this design.] 

10-16. Weighted least squares. Suppose that we are fitting the straight line y = / 3 0 + fiiX + e, 
but the variance of the y’s now depends on the level of x; that is, 

V(y I*,) = of = i = 1, 2,..., n 

where the w, are known constants, often called weights. Show that if we choose estimates 
of the regression coefficients to minimize the weighted sum of squared errors given by 
2 w,(y, ~ Po ~ PiX,) 2 , the resulting least squares normal equations are 

fio E w >t + ^ E w, Xi = E w ty, 
fio E wiXi + fit E w,xf = E w, 
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10-17. Consider the 2? v ' design discussed in Example 10-5. 

(a) Suppose you elect to augment the design with the single run selected in that ex¬ 
ample. Find the variances and covariances of the regression coefficients in the 
model (ignoring blocks): 

y = Po + l3,x, + p2X 2 + 183X3 + 184X4 

+ Pi2 x i x 2 + ^34X3X4 + e 

(b) Are there any other runs in the alternate fraction that would de-alias AB from CD? 

(c) Suppose you augment the design with the four runs suggested in Example 10-5. 
Find the variances and covariances of the regression coefficients (ignoring blocks) 
for the model in part (a). 

(d) Considering parts (a) and (c), which augmentation strategy would you prefer, and 
why? 

10-18. Consider a 2Ju 4 design. Suppose after running the experiment, the largest observed ef¬ 
fects are A + BD, B + AD, and D + AB. You wish to augment the original design with 

a group of four runs to de-alias these effects. 

(a) Which four runs would you make? 

(b) Find the variances and covariances of the regression coefficients in the model 

y = 18 0 + Pi x \ + i 3 2 x 2 + 184X4 + 1812X1X2 + / 3 ]4 x 1 x 4 

+ 1824X2X4 + e. 

(c) Is it possible to de-alias these effects with fewer than four additional runs? 


Response Surface 
Methods and 
Other Approaches 
to Process 
Optimization 


11-1 INTRODUCTION TO RESPONSE SURFACE METHODOLOGY 

Response surface methodology, or RSM, is a collection of mathematical and statistical 
techniques that are useful for the modeling and analysis of problems in which a response 
of interest is influenced by several variables and the objective is to optimize this response. 
For example, suppose that a chemical engineer wishes to find the levels of temperature 
(*i) and pressure (x 2 ) that maximize the yield (y) of a process. The process yield is a 
function of the levels of temperature and pressure, say 

y = f( x u x 2 ) + e 

where e represents the noise or error observed in the response y. If we denote the expected 
response by E(y) = f(x u x 2 ) = ij, then the surface represented by 

V = /(*i, x 2 ) 


is called a response surface. 

We usually represent the response surface graphically, such as in Figure 11-1 (on 
the next page), where 17 is plotted versus the levels of jc, and x 2 . We have seen response 
surface plots such as this before, particularly in the chapters on factorial designs. To help 
visualize the shape of a response surface, we often plot the contours of the response 
surface as shown in Figure 11-2 (on the next page). In the contour plot, lines of constant 
response are drawn in the x u x 2 plane. Each contour corresponds to a particular height 
of the response surface. We have also previously seen the utility of contour plots. 

In most RSM problems, the form of the relationship between the response and the 
independent variables is unknown. Thus, the first step in RSM is to find a suitable ap¬ 
proximation for the true functional relationship between y and the set of independent 
variables. Usually, a low-order polynomial in some region of the independent variables 
is employed. If the response is well modeled by a linear function of the independent 
variables, then the approximating function is the first-order model 


y = £o + 01*1 + P 2 X 2 + • • • + p k x k + € 


( 11 - 1 ) 
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Figure 11-1 A three-dimensional response surface showing the expected yield (rj) 
as a function of temperature (x,) and pressure (x 2 ). 



Figure 11-2 A contour plot of a response surface. 
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If there is curvature in the system, then a polynomial of higher degree must be used, 
such as the second-order model 

y = Po + 2 P>x, + E p„xf + EE PijXiXj + € (11-2) 

Almost all RSM problems use one or both of these models. Of course, it is unlikely that 
a polynomial model will be a reasonable approximation of the true functional relationship 
over the entire space of the independent variables, but for a relatively small region they 
usually work quite well. 

The method of least squares, discussed in Chapter 10, is used to estimate the parameters 
in the approximating polynomials. The response surface analysis is then performed using 
the fitted surface. If the fitted surface is an adequate approximation of the true response 
function, then analysis of the fitted surface will be approximately equivalent to analysis 
of the actual system. The model parameters can be estimated most effectively if proper 
experimental designs are used to collect the data. Designs for fitting response surfaces are 
called response surface designs. These designs are discussed in Section 11-4. 

RSM is a sequential procedure. Often, when we are at a point on the response 
surface that is remote from the optimum, such as the current operating conditions in 
Figure 11-3, there is little curvature in the system and the first-order model will be 
appropriate. Our objective here is to lead the experimenter rapidly and efficiently along 
a path of improvement toward the general vicinity of the optimum. Once the region of 
the optimum has been found, a more elaborate model, such as the second-order model, 
may be employed, and an analysis may be performed to locate the optimum. From Figure 
11-3, we see that the analysis of a response surface can be thought of as “climbing a 
hill,” where the top of the hill represents the point of maximum response. If the true 
optimum is a point of minimum response, then we may think of “descending into a 
valley. ’ ’ 

The eventual objective of RSM is to determine the optimum operating conditions 
for the system or to determine a region of the factor space in which operating require- 
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ments are satisfied. More extensive presentations of RSM are in Myers and Montgomery 
(1995), Khuri and Cornell (1996), and Box and Draper (1987). 


11 *2 THE METHOD OF STEEPEST ASCENT 

Frequently, the initial estimate of the optimum operating conditions for the system will 
be far from the actual optimum. In such circumstances, the objective of the experimenter 
is to move rapidly to the general vicinity of the optimum. We wish to use a simple and 
economically efficient experimental procedure. When we are remote from the optimum, 
we usually assume that a first-order model is an adequate approximation to the true 
surface in a small region of the x's. 

The method of steepest ascent is a procedure for moving sequentially along the 
path of steepest ascent, that is, in the direction of the maximum increase in the response. 
Of course, if minimization is desired, then we call this technique the method of steepest 
descent. The fitted first-order model is 

y = Po + 2 ( 11 - 3 ) 

and the first-order response surface, that is, the contours of y, is a series of parallel lines 
such as that shown in Figure 11-4. The direction of steepest ascent is the direction in 
which $ increases most rapidly. This direction is parallel to the normal to the fitted 
response surface. We usually take as the path of steepest ascent the line through the 
center of the region of interest and normal to the fitted surface. Thus, the steps along the 



Figure 11-4 First-order response surface and path of steepest ascent. 
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path are proportional to the regression coefficients {/3,}. The actual step size is determined 
by the experimenter based on process knowledge or other practical considerations. 

Experiments are conducted along the path of steepest ascent until no further increase 
in response is observed. Then a new first-order model may be fit, a new path of steepest 
ascent determined, and the procedure continued. Eventually, the experimenter will arrive 
in the vicinity of the optimum. This is usually indicated by lack of fit of a first-order 
model. At that time additional experiments are conducted to obtain a more precise esti¬ 
mate of the optimum. 


EXAMPLE 11-1 . 

A chemical engineer is interested in determining the operating conditions that maximize 
the yield of a process. Two controllable variables influence process yield: reaction time 
and reaction temperature. The engineer is currently operating the process with a reaction 
time of 35 minutes and a temperature of 155°F, which result in yields of around 40 
percent. Because it is unlikely that this region contains the optimum, she fits a first-order 
model and applies the method of steepest ascent. 

The engineer decides that the region of exploration for fitting the first-order model 
should be (30,40) minutes of reaction time and (150,160)°F. To simplify the calculations, 
the independent variables will be coded to the usual (—1, 1) interval. Thus, if denotes 
the natural variable time and £ 2 denotes the natural variable temperature, then the 
coded variables are 


Xi 


6 ~ 35 
5 


and 


The experimental design is shown in Table 11-1. Note that the design used to collect 
this data is a 2 2 factorial augmented by five center points. Replicates at the center are 
used to estimate the experimental error and to allow for checking the adequacy of the 
first-order model. Also, the design is centered about the current operating conditions for 
the process. 

A first-order model may be fit to these data by least squares. Following the methods 
for two-level designs, we obtain the following model in the coded variables: 

y = 40.44 + 0.775*, + 0.325x 2 


Table 11-1 Process Data for Fitting the First- 
Order Model 


Natural 

Variables 

fi_ 

30 150 

30 160 

40 150 

40 160 

35 155 

35 155 

35 155 

35 155 

35 155 


Coded 

Variables 


0 

0 

0 

0 

0 


Response 

y 

39.3 
40.0 
40.9 

41.5 

40.3 

40.5 
40.7 
40.2 

40.6 


-1 
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Before exploring along the path of steepest ascent, the adequacy of the first-order 
model should be investigated. The 2 2 design with center points allows the experimenter 
to 


1. Obtain an estimate of error 

2. Check for interactions (cross-product terms) in the model 

3. Check for quadratic effects (curvature) 

The replicates at the center can be used to calculate an estimate of error as follows: 
_ (40.3) 2 + (40.5) 2 + (40.7) 2 + (40.2) 2 + (40.6) 2 - (202.3) 2 /5 


The first-order model assumes that the variables x } and x 2 have an additive effect on the 
response. Interaction between the variables would be represented by the coefficient f} l2 
of a cross-product term x t x 2 added to the model. The least squares estimate of this 
coefficient is just one-half the interaction effect calculated as in an ordinary 2 2 factorial 
design, or 

0,2 = i[(l X 39.3) + (1 X 41.5) + (-1 X 40.0) + (-1 X 40.9)] 

= K-o.i) 

= -0.025 

The single degree of freedom sum of squares for interaction is 



= 0.0025 


Comparing SS| nteraction to & 2 gives a lack-of-fit statistic 

r, ^interaction 

F J 2 

= 0.0025 
“ 0.0430 
= 0.058 

which is small, indicating that interaction is negligible. 

Another check of the adequacy of the straight-line model is obtained by applying 
the check for pure quadratic curvature effect described in Section 6-6. Recall that this 
consists of comparing the average response at the four points in the factorial portion of 
the design, say y F = 40.425, with the average response at the design center, say y c = 
40.46. If there is quadratic curvature in the true response function, then y F — y c is a 
measure of this curvature. If /3 U and f} 22 are the coefficients of the “pure quadratic” 
terms x\ and x\, then y F - y c is an estimate of /3 U + p 22 . In our example, an estimate 
of the pure quadratic term is 

j 8 n + 022 = y F -y c 

= 40.425 - 40.46 
= -0.035 
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Table 11-2 Analysis of Variance for the First-Order Model 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

ft 

ft Value 

Model (ft, ft) 

2.8250 

2 

1.4125 

47.83 

0.0002 

Residual 

0.1772 

6 




(Interaction) 

(0.0025) 

1 

0.0025 

0.058 

0.8215 

(Pure quadratic) 

(0.0027) 

1 

0.0027 

0.063 

0.8142 

(Pure error) 

(0.1720) 

4 

0.0430 



Total 

3.0022 

8 





The single-degree-of-freedom sum of squares associated with the null hypothesis, 
//„:/3n +/3 22 = 0, is 

™ _ n F n c (y F - y c f 

‘J^Pure Quadratic . 

n F + n c 

_ (4)(5)(—0.035) 2 
4 + 5 
= 0.0027 

where n F and n c are the number of points in the factorial portion and the number of 
center points, respectively. Because 

SSpure Quadratic 

F ~ 

= 0.0027 
“ 0.0430 
= 0.063 

is small, there is no indication of a pure quadratic effect. 

The analysis of variance for this model is summarized in Table 11-2. Both the in¬ 
teraction and curvature checks are not significant, whereas the F test for the overall 
regression is significant. Furthermore, the standard error of /ft and /ft is 



Both regression coefficients /ft and /ft are large relative to their standard errors. At this 
point, we have no reason to question the adequacy of the first-order model. 

To move away from the design center—the point (jc, = 0, x 2 = 0)—along the path 
of steepest ascent, we would move 0.775 units in the jc, direction for every 0.325 units 
in the x 2 direction. Thus, the path of steepest ascent passes through the point (x, = 0, 
*2 = 0) and has a slope 0.325/0.775. The engineer decides to use 5 minutes of reaction 
time as the basic step size. Using the relationship between and x,, we see that 5 minutes 
of reaction time is equivalent to a step in the coded variable x, of Ax, = 1. Therefore, 
the steps along the path of steepest ascent are Ax, = 1.0000 and Ax 2 = (0.325/0.775) 
Ax, = 0.42. 

The engineer computes points along this path and observes the yields at these points 
until a decrease in response is noted. The results are shown in Table 11-3 (on the next 
page) in both coded and natural variables. Although the coded variables are easier to 
manipulate mathematically, the natural variables must be used in running the process. 
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Table 11-3 Steepest Ascent Experiment for Example 11-1 


Steps 

Coded Variables 

Natural Variables 

Response 

y 

*1 

*2 

ft 

ft 

Origin 

0 

0 

35 

155 


A 

1.00 

0.42 

5 

2 


Origin + A 

1.00 

0.42 

40 

157 

41.0 

Origin + 2A 

2.00 

0.84 

45 

159 

42.9 

Origin + 3 A 

3.00 

1.26 

50 

161 

47.1 

Origin + 4A 

4.00 

1.68 

55 

163 

49.7 

Origin + 5 A 

5.00 

2.10 

60 

165 

53.8 

Origin + 6A 

6.00 

2.52 

65 

167 

59.9 

Origin + 7 A 

7.00 

2.94 

70 

169 

65.0 

Origin + 8A 

8.00 

3.36 

75 

171 

70.4 

Origin + 9A 

9.00 

3.78 

80 

173 

77.6 

Origin + 10A 

10.00 

4.20 

85 

175 

80.3 

Origin + 11A 

11.00 

4.62 

90 

179 

76.2 

Origin + 12A 

12.00 

5.04 

95 

181 

75.1 


Figure 11-5 plots the yield at each step along the path of steepest ascent. Increases in 
response are observed through the tenth step; however, all steps beyond this point result 
in a decrease in yield. Therefore, another first-order model should be fit in the general 
vicinity of the point (& = 85, & = 175). 

A new first-order model is fit around the point (& = 85, £2 = 175). The region of 
exploration for & is [80, 90], and for ft, it is [170, 180]. Thus, the coded variables are 


& ~ 175 
5 



Figure 11-5 Yield versus steps along the path of steepest ascent 
for Example 11-1. 
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Table 11-4 Data for Second First-Order Model 


Natural 

Variables 

Coded 

Variables 

Response 

y 


& 

X, 

*2 

80 

170 

-1 

-1 

76.5 

80 

180 

-1 

1 

77.0 

90 

170 

1 

-1 

78.0 

90 

180 

1 

1 

79.5 

85 

175 

0 

0 

79.9 

85 

175 

0 

0 

80.3 

85 

175 

0 

0 

80.0 

85 

175 

0 

0 

79.7 

85 

175 

0 

0 

79.8 


Once again, a 2 2 design with five center points is used. The experimental design is shown 
in Table 11-4. 

The first-order model fit to the coded variables in Table 11-4 is 
y = 78.97 + l.OOxi + 0.50x 2 

The analysis of variance for this model, including the interaction and pure quadratic term 
checks, is shown in Table 11-5. The interaction and pure quadratic checks imply that the 
first-order model is not an adequate approximation. This curvature in the true surface 
may indicate that we are near the optimum. At this point, additional analysis must be 
done to locate the optimum more precisely. 


From Example 11-1 we notice that the path of steepest ascent is proportional to the 
signs and magnitudes of the regression coefficients in the fitted first-order model 

9 = fro + It & x i 

It is easy to give a general algorithm for determining the coordinates of a point on the 
path of steepest ascent. Assume that the point Xj = x 2 = ... = x k = 0 is the base or 
origin point. Then 

1. Choose a step size in one of the process variables, say Ax,. Usually, we would 
select the variable we know the most about, or we would select the variable that 
has the largest absolute regression coefficient | j8, |. 


Table 11-5 Analysis of Variance for the Second First-Order Model 
Sum of Degrees of Mean 

Source of Variation Squares Freedom Square 


Regression 
Residual 
(Interaction) 
(Pure quadratic) 
(Pure error) 
Total 


5.00 2 

11.1200 6 

(0.2500) 

(10.6580) 

( 0 . 2120 ) 

16.1200 8 


1 0.2500 

1 10.6580 

4 0.0530 


4.72 

201.09 


P-Value 


0.0955 

0.0001 
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2. The step size in the other variables is 


Ax, = -s 77 i = 1,2,, k\ 
pj/Axj 


i*j 


3. Convert the Ax, from coded variables to the natural variables. 


To illustrate, consider the path of steepest ascent computed in Example 11-1. Because 
X\ has the largest regression coefficient, we select reaction time as the variable in step 1 
of the above procedure. Five minutes of reaction time is the step size (based on process 
knowledge). In terms of the coded variables, this is Ax, = 1.0. Therefore, from guideline 
2, the step size in temperature is 


Ax 2 


_A_ 

A/Ax, 


0.325 

(0.775/1.0) 


0.42 


To convert the coded step sizes (Ax, = 1.0 and Ax 2 = 0.42) to the natural units of time 
and temperature, we use the relationships 


Ax, 


A£, 

5 


and Ax 2 


5 


which results in 

A£i = Ax, (5) = 1.0(5) = 5 min 


A& = Ax 2 (5) = 0.42(5) = 2°F 

11 -3 ANALYSIS OF A SECOND-ORDER RESPONSE SURFACE 

When the experimenter is relatively close to the optimum, a model that incorporates 
curvature is usually required to approximate the response. In most cases, the second- 
order model 

y = I A, + E Ax, + E A t xf + EE A i**t + e (11-4) 

is adequate. In this section we will show how to use this fitted model to find the optimum 
set of operating conditions for the x’s and to characterize the nature of the response 
surface. 


11-3.1 Location of the Stationary Point 

Suppose we wish to find the levels of x,, x 2 ,..., x k that optimize the predicted response. 
This point, if it exists, will be the set of x,, x 2 ,..., x k for which the partial derivatives 
dy!dx x = dy/dx 2 = • ■ ■ = dy/dx k = 0. This point, say x l s , x 2 s ,..., x* jS , is called the 
stationary point. The stationary point could represent (1) a point of maximum response, 
(2) a point of minimum response, or (3) a saddle point. These three possibilities are 
shown in Figures 11-6, 11-7 (page 438), and 11-8 (page 439). 

Contour plots play a very important role in the study of the response surface. By 
generating contour plots using computer software for response surface analysis, the ex- 
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perimenter can usually characterize the shape of the surface and locate the optimum with 
reasonable precision. 

We may obtain a general mathematical solution for the location of the stationary 
point. Writing the second-order model in matrix notation, we have 

$ = p 0 + x'b + x'Bx (11-5) 

where 



X, 


A 


A, 012/2, .. • , Pik/2~ 

X = 

*2 

b = 

& 

and B = 

022. ■ • ■ , 02t/2 


_X k _ 


A 


sym. /I** 


That is, b is a (k X 1) vector of the first-order regression coefficients and B is a (k X k) 
symmetric matrix whose main diagonal elements are the pure quadratic coefficients 
(0„) and whose off-diagonal elements are one-half the mixed quadratic coefficients A 
i # j). The derivative of y with respect to the elements of the vector x equated to 0 is 

^ = b + 2Bx = 0 (11-6) 

dx 

The stationary point is the solution to Equation 11-6, or 

x s = -iB-'b (11-7) 

Furthermore, by substituting Equation 11-7 into Equation 11-5, we can find the predicted 
response at the stationary point as 

& = jS 0 + |x'b (11-8) 


11-3.2 Characterizing the Response Surface 

Once we have found the stationary point, it is usually necessary to characterize the 
response surface in the immediate vicinity of this point. By characterize, we mean 
determine whether the stationary point is a point of maximum or minimum response or 
a saddle point. We also usually want to study the relative sensitivity of the response to 
the variables x u x 2 ,..., x k . 

As we mentioned previously, the most straightforward way to do this is to examine 
a contour plot of the fitted model. If there are only two or three process variables (the 
x’s), the construction and interpretation of this contour plot is relatively easy. However, 
even when there are relatively few variables, a more formal analysis, called the canonical 
analysis, can be useful. 

It is helpful first to transform the model into a new coordinate system with the origin 
at the stationary point x s and then to rotate the axes of this system until they are parallel 
to the principal axes of the fitted response surface. This transformation is illustrated in 
Figure 11-9 on the facing page. We can show that this results in the fitted model 

y = y s + A ,w? + A 2 w| + • • • + A k w 2 k (11-9) 

where the { w ,) are the transformed independent variables and the {A,} are constants. 
Equation 11-9 is called the canonical form of the model. Furthermore, the {A,} are just 
the eigenvalues or characteristic roots of the matrix B. 


11-3 ANALYSIS OF A SECOND-ORDER RESPONSE SURFACE 


441 



Figure 11-9 Canonical form of the second-order model. 


The nature of the response surface can be determined from the stationary point and 
the signs and magnitudes of the {A,}. First, suppose that the stationary point is within 
the region of exploration for fitting the second-order model. If the {A,} are all positive, 
x s is a point of minimum response; if the {A,} are all negative, x s is a point of maximum 
response; and if the {A,} have different signs, x s is a saddle point. Furthermore, the surface 
is steepest in the w, direction for which |A ; | is the greatest. For example, Figure 11-9 
depicts a system for which x s is a maximum (A, and A 2 are negative) with | A L | > | A 2 1. 

EXAMPLE 11-2 . 

We will continue the analysis of the chemical process in Example 11-1. A second-order 
model in the variables x t and x 2 cannot be fit using the design in Table 11-4. The ex¬ 
perimenter decides to augment this design with enough points to fit a second-order 
model. 1 She obtains four observations at (jq = 0, x 2 = ±1.414) and (jq = ±1.414, 
x 2 = 0). The complete experiment is shown in Table 11-6 (page 442), and the design is 
displayed in Figure 11-10 (on the next page). This design is called a central composite 
design (or a CCD) and will be discussed in more detail in Section 11-4.2. In this second 
phase of the study, two additional responses were of interest, the viscosity and the mo¬ 
lecular weight of the product. The responses are also shown in Table 11-6. 

We will focus on fitting a quadratic model to the yield response y x (the other re¬ 
sponses will be discussed in Section 11-3.4). We generally use computer software to fit 
a response surface and to construct the contour plots. Table 11-7 (pages 443-444) contains 
the output from Design-Expert. From examining this table we notice that this software 
package first computes the ‘ ‘ sequential or extra sums of squares ’ ’ for the linear, quadratic, 
and cubic terms in the model (there is a warning message concerning aliasing in the 
cubic model because the CCD does not contain enough runs to support a full cubic 


1 The engineer ran the additional four observations at about the same time she ran the original nine observations. If 
substantial time had elapsed between the two sets of runs, blocking would have been necessary. Blocking in response 
surface designs is discussed in Section 11-4.3. 
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Table 11-6 Central Composite Design for Example 11-2 


Natural 

Variables 

Coded Variables 


Response: 


6 

& 

X \ 

*2 

Vi (yield) 

y 2 (viscosity) 

y 3 (molecular weight) 

80 

170 

-1 

-1 

76.5 

62 

2940 

80 

180 

-1 

1 

77.0 

60 

3470 

90 

170 

1 

-1 

78.0 

66 

3680 

90 

180 

1 

1 

79.5 

59 

3890 

85 

175 

0 

0 

79.9 

72 

3480 

85 

175 

0 

0 

80.3 

69 

3200 

85 

175 

0 

0 

80.0 

68 

3410 

85 

175 

0 

0 

79.7 

70 

3290 

85 

175 

0 

0 

79.8 

71 

3500 

92.07 

175 

1.414 

0 

78.4 

68 

3360 

77.93 

175 

-1.414 

0 

75.6 

71 

3020 

85 

182.07 

0 

1.414 

78.5 

58 

3630 

85 

167.93 

0 

-1.414 

77.0 

57 

3150 


model). Based on the small P-value for the quadratic terms, we decided to fit the second- 
order model to the yield response. The computer output shows the final model in terms 
of both the coded variables and the natural or actual factor levels. 

Figure 11-11 (page 445) shows the three-dimensional response surface plot and the 
contour plot for the yield response in terms of the process variables time and temperature. 
It is relatively easy to see from examining these figures that the optimum is very near 
175°F and 85 minutes of reaction time and that the response is at a maximum at this 
point. From examination of the contour plot, we note that the process may be slightly 
more sensitive to changes in reaction time than to changes in temperature. 


<0,1.414) 


(-1.1) 


(-1.414, 0) I | (0,0) 


(- 1 ,- 1 ) 


( 1 , 1 ) 


(1.414,0) +2 


( 1 ,- 1 ) 


x^ 


>(0,-1.414) 


Figure 11-10 Central composite design for Example 11-2. 
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Table 11-7 Computer Output from Design-Expert for Fitting a Model to the Yield Response in 
Example 11-2 

Response: yield 

•••WARNING: The Cubic Model is Aliased!*** 


Sequential Model Sum of Squares 



Sum of 


Mean 

F 



Source 

Squares 

DF 

Square 

Value 

Prob > F 


Mean 

80062.16 

1 

80062.16 




Linear 

10.04 

2 

5.02 

2.69 

0.1166 


2FI 

0.25 

1 

0.25 

0.12 

0.7350 


Quadratic 

17.95 

2 

8.98 

126.88 

<0.0001 

Suggested 

Cubic 

2.042E-003 

2 

1.021E-003 

0.010 

0.9897 

Aliased 

Residual 

0.49 

5 

0.099 




Total 

80090.90 

13 

6160.84 




"Sequential Model Sum of Squares?': 

: Select the highest order 

polynomial where the 

additional terms are significant. 





Lack of Fit Tests 







Sum of 


Mean 

F 



Source 

Squares 

DF 

Square 

Value 

Prob > F 


Linear 

18.49 

6 

3.08 

58.14 

0.0008 


2FI 

18.24 

5 

3.65 

68.82 

0.0006 


Quadratic 

0.28 

3 

0.094 

1.78 

0.2897 

Suggested 

Cubic 

0.28 

1 

0.28 

5.31 

0.0826 

Aliased 

Pure Error 

0.21 

4 

0.053 





“Lack of Fit Tests": Want the selected model to have insignificant lack-of-fit. 

Model Summary Statistics 

Std. Adjusted Predicted 

Source Dev. R-Squared R-Squared R-Squared PRESS 

Linear 1.37 0.3494 0.2193 -0.0435 29.99 

2FI 1.43 0.3581 0.1441 -0.2730 36.59 

Quadratic 0.27 0.9828 0.9705 0.9184 2.35 Suggested 

Cubic 0.31 0.9828 0.9588 0.3622 18.33 Aliased 

“Model Summary Statistics Focus on the model minimizing the "PRESS", or 
equivalently maximizing the "PRED R-SQR". 

Response: yield 

ANOVA for Response Surface Quadratic Model 
Analysis of variance table [Partial sum of squares] 

Sum of Mean F 

Source Squares DF Square Value Prob > F 

Model 28.25 5 5.65 79.85 <0.0001 

A 7.92 1 7.92 111.93 <0.0001 

B 2.12 1 2.12 30.01 0.0009 

A 2 13.18 1 13.18 186.22 <0.0001 

B 2 6.97 1 6.97 98.56 <0.0001 

AB 0.25 1 0.25 3.53 0.1022 

Residual 0.50 7 0.071 

Lack of Fit 0.28 3 0.094 1.78 0.2897 

Pure Error 0.21 4 0.053 

Cor Total 28.74 12 

Std. Dev. 0.27 R-Squared 0.9828 

Mean 78.48 Adj R-Squared 0.9705 

0.34 Pred R-Squared 0.9184 

2.35 Adeq Precision 23.018 


C.V. 

PRESS 
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Table 11*7 (continued) 

Coefficient Standard 

Factor Estimate DF Error 

Intercept 79.94 1 0.12 

A-time 0.99 1 0.094 

B-temp 0.52 1 0.094 

A 2 -1.38 1 0.10 

B 2 -1.00 1 0.10 

AB 0.25 1 0.13 

Final Equation in Terms of Coded Factors: 

yield = 

+79.94 
+0.99 * A 
+0.52 * B 
-1.38 * A 2 
-1.00 * B 2 
+0.25 * A * B 

Final Equation in Terms of Actual Factors: 

yield = 

-1430.52285 

+7.80749 * time 
+ 13.27053 * temp 
-0.055050 * time 2 
-0.040050 * temp 2 
+0.010000 * time * temp 

Diagnostics Case Statistics 
Run Standard Actual Predicted Student Cook's Outlier 

Order Order Value Value Residual Leverage Residual Distance t 

8 1 76.50 76.30 0.20 0.625 1.213 0.409 1.264 

6 2 78.00 77.79 0.21 0.625 1.275 0.452 1.347 

9 3 77.00 76.83 0.17 0.625 1.027 0.293 1.032 

11 4 79.50 79.32 0.18 0.625 1.089 0.329 1.106 

12 5 75.60 75.78 -0.18 0.625 -1.107 0.341 -1.129 

10 6 78.40 78.59 -0.19 0.625 -1.195 0.396 -1.240 

7 7 77.00 77.21 -0.21 0.625 -1.283 0.457 -1.358 

1 8 78.50 78.67 -0.17 0.625 -1.019 0.289 -1.023 

5 9 79.90 79.94 -0.040 0.200 -0.168 0.001 -0.156 

3 10 80.30 79.94 0.36 0.200 1.513 0.095 1.708 

13 11 80.00 79.94 0.060 0.200 0.252 0.003 0.235 

2 12 79.70 79.94 -0.24 0.200 -1.009 0.042 -1.010 

4 13 79.80 79.94 -0.14 0.200 -0.588 0.014 -0.559 


We could also find the location of the stationary point using the general solution in 
Equation 11-7. Note that 

0.995] [-1.376 0.1250] 

0.515 J B [ 0.1250 -1.001 J 

and from Equation 11-7 the stationary point is 
x s = -^B^b 

-0.7345 -0.0917] [0.995] = [0.389] 

-0.0917 -1.0096] [o.515 J [o.306j 



95% Cl 95% Cl 

Low High VIF 

79.66 80.22 

0.77 1.22 1.00 

0.29 0.74 1.00 

-1.61 -1.14 1.02 

-1.24 -0.76 1.02 

-0.064 0.56 1.00 
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That is, x u = 0.389 and x 2 ,s = 0.306. In terms of the natural variables, the stationary 
point is 


which yields & = 86.95 — 87 minutes of reaction time and £ 2 = 176.53 — 176.5°F. 
This is very close to the stationary point found by visual examination of the contour plot 
in Figure 11-11. Using Equation 11-8, we may find the predicted response at the sta¬ 
tionary point as y s = 80.21. 
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We may also use the canonical analysis described in this section to characterize the 
response surface. First, it is necessary to express the fitted model in canonical form 
(Equation 11-9). The eigenvalues A, and A 2 are the roots of the determinantal equation 

|B - AI| = 0 
1—1.376 — A 0.1250 I 
| 0.1250 -1.001 -A | 

which reduces to 

A 2 + 2.3788A + 1.3639 = 0 

The roots of this quadratic equation are A, = —0.9641 and A 2 = —1.4147. Thus, the 
canonical form of the fitted model is 

y = 80.21 - 0.964M - 1.4147w| 

Because both A, and A 2 are negative and the stationary point is within the region of 
exploration, we conclude that the stationary point is a maximum. 


In some RSM problems it may be necessary to find the relationship between the 
canonical variables {w,} and the design variables {*,}. This is particularly true if it is 
impossible to operate the process at the stationary point. As an illustration, suppose that 
in Example 11-2 we could not operate the process at = 87 minutes and & = 176.5°F 
because this combination of factors results in excessive cost. We now wish to “back 
away” from the stationary point to a point of lower cost without incurring large losses 
in yield. The canonical form of the model indicates that the surface is less sensitive to 
yield loss in the w, direction. Exploration of the canonical form requires converting points 
in the (w,, w 2 ) space to points in the (*,, x 2 ) space. 

In general, the variables x are related to the canonical variables w by 

w = M'(x - Xs) 

where M is a (k X k) orthogonal matrix. The columns of M are the normalized eigen¬ 
vectors associated with the (A,). That is, if m, is the ith column of M, then m, is the 
solution to 

(B - A, I)m, = 0 (11-10) 

for which 2* =1 m 2 = 1. 

We illustrate the procedure using the fitted second-order model in Example 11-2. 
For A, = -0.9641, Equation 11-10 becomes 

[(-1.376 + 0.9641) 0.1250 ir^nl^ 0 ] 

[ 0.1250 (-1.001 + 0.9641) |_0J 

or 

-0.4129m„ + 0.1250m 21 = 0 
0.1250m,, - 0.0377m 21 = 0 

We wish to obtain the normalized solution to these equations, that is, the one for which 
mf i + mh = 1- There is no unique solution to these equations, so it is most convenient 
to assign an arbitrary value to one unknown, solve the system, and normalize the solution. 
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Letting m* — 1, we find m* x = 0.3027. To normalize this solution, we divide 
m*i by 

V(m* ) 2 + (mf,) 2 = V(0.3027) 2 + (l) 2 = 1.0448 
This yields the normalized solution 


0.3027 
' 1.0448 ' 


and 


and 


1.0448 1.0448 


which is the first column of the M matrix. 

Using A 2 = -1.4147, we can repeat the above procedure, obtaining m l2 - -0.9574 
and m 12 = 0.2888 as the second column of M. Thus, we have 


M = 


0.2897 

0.9571 


-0.9574" 

0.2888 


The relationship between the w and x variables is 

Mr 0.2897 0.9571] U - 0.389] 
[w 2 J [-0.9574 0.2888] [x 2 - 0.306 J 


or 


w, = 0.28970c, - 0.389) + 0.9571 (jc 2 - 0.306) 
w 2 = —0.95740c, - 0.389) + 0.2888 (jc 2 - 0.306) 

If we wished to explore the response surface in the vicinity of the stationary point, we 
could determine appropriate points at which to take observations in the (w,, w 2 ) space 
and then use the above relationship to convert these points into the (jc,, jc 2 ) space so that 
the runs may be made. 


11-3.3 Ridge Systems 

It is not unusual to encounter variations of the pure maximum, minimum, or saddle point 
response surfaces discussed in the previous section. Ridge systems, in particular, are 
fairly common. Consider the canonical form of the second-order model given previously 
in Equation 11-9: 

y = y s + A x wf + A 2 wl + • • • + A*wf 

Now suppose that the stationary point x s is within the region of experimentation; fur¬ 
thermore, let one or more of the A, be very small (e.g.. A, — 0). The response variable is 
then very insensitive to the variables w, multiplied by the small A, . 

A contour plot illustrating this situation is shown in Figure 11-12 (page 448) for 
k = 2 variables with A, = 0. (In practice, A, would be close to but not exactly equal 
to zero.) The canonical model for this response surface is theoretically 

y = % + a 2 w| 
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Figure 11-13 A contour plot of a rising ridge system. 


with A 2 negative. Notice that the severe elongation in the w, direction has resulted in a 
line of centers at y = 70 and the optimum may be taken anywhere along that line. This 
type of response surface is called a stationary ridge system. 

If the stationary point is far outside the region of exploration for fitting the second- 
order model and one (or more) A, is near zero, then the surface may be a rising ridge. 
Figure 11-13 illustrates a rising ridge for k = 2 variables with A! near zero and A 2 
negative. In this type of ridge system, we cannot draw inferences about the true surface 
or the stationary point because x s is outside the region where we have fit the model. 
However, further exploration is warranted in the w, direction. If A 2 had been positive, 
we would call this system a falling ridge. 


11 -3.4 Multiple Responses 

Many response surface problems involve the analysis of several responses. For instance, 
in Example 11-2, three responses were measured by the experimenter. In that example, 
we optimized the process with respect to only the yield response y t . 

Simultaneous consideration of multiple responses involves first building an appro¬ 
priate response surface model for each response and then trying to find a set of operating 
conditions that in some sense optimizes all responses or at least keeps them in desired 
ranges. An extensive treatment of the multiple response problem is given in Myers and 
Montgomery (1995). 

We may obtain models for the viscosity and molecular weight responses (y 2 and y 3 , 
respectively) in Example 11-2 as follows: 

y 2 = 70.00 - 0.16x, - 0.95x 2 - 0.69.x? - 6.69x1 - l.25x { x 2 
y 3 = 3386.2 + 205. lx, + 17.4x 2 

In terms of the natural levels of time (£,) and temperature (£,), these models are 
y 2 = -9030.74 + 13.393& + 97.708& 

- 2.75 X 10“ 2 £? - 0.26757£| - 5 X 10“ 2 ^fe 

and 


y 3 = -6308.8 + 41.025& + 35.473& 
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Figures 11-14 and 11-15 (page 450) present the contour and response surface plots for 
these models. 

A relatively straightforward approach to optimizing several responses that works 
well when there are only a few process variables is to overlay the contour plots for 
each response. Figure 11-16 (page 451) shows an overlay plot for the three responses in 
Example 11-2, with contours for which y t (yield) 78.5, 62 y 2 (viscosity) =s 68, and 
y 3 (molecular weight Mn) 3400. If these boundaries represent important conditions 
that must be met by the process, then as the unshaded portion of Figure 11-16 shows, 
there are a number of combinations of time and temperature that will result in a satis- 



167.9 77.93 


(6) The response surface plot 

Figure 11-14 Contour plot and response surface plot of viscosity. 
Example 11-2. 
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(a) The contour plot 



Figure 11-15 Contour plot and response surface plot of molecular 
weight, Example 11-2. 


factory process. The experimenter can visually examine the contour plot to determine 
appropriate operating conditions. For example, it is likely that the experimenter would 
be most interested in the larger of the two feasible operating regions shown in Figure 
11-16. 

When there are more than three design variables, overlaying contour plots becomes 
awkward, because the contour plot is two-dimensional, and k - 2 of the design variables 
must be held constant to construct the graph. Often a lot of trial and error is required to 
determine which factors to hold constant and what levels to select to obtain the best view 
of the surface. Therefore, there is practical interest in more formal optimization methods 
for multiple responses. 



11-3 ANALYSIS OF A SECOND-ORDER RESPONSE SURFACE 451 



Figure 11-16 Region of the optimum found by overlaying yield, vis¬ 
cosity, and molecular weight response surfaces. Example 11-2. 


A popular approach is to formulate and solve the problem as a constrained op¬ 
timization problem. To illustrate using Example 11-2, we might formulate the prob¬ 
lem as 


Max ^ 
subject to 
62 =£ y 2 68 
;y 3 3400 

There are many numerical techniques that can be used to solve this problem. Sometimes 
these techniques are referred to as nonlinear programming methods. The Design- 
Expert software package solves this version of the problem using a direct search pro¬ 
cedure. The two solutions found are 


and 


time = 83.5 temp = 177.1 Si = 79.5 
time = 86.6 temp = 172.25 yi = 79.5 


Notice that the first solution is in the upper (smaller) feasible region of the design space 
(refer to Figure 11-16), whereas the second solution is in the larger region. Both solutions 
are very near to the boundary of the constraints. 

Another useful approach to optimization of multiple responses is to use the simul¬ 
taneous optimization technique popularized by Derringer and Suich (1980). Their pro¬ 
cedure makes use of desirability functions. The general approach is to first convert each 
response y, into an individual desirability function d t that varies over the range 
0 1 


where if the response y, is at its goal or target, then d t = 1, and if the response is outside 
an acceptable region, d, = 0. Then the design variables are chosen to maximize the 
overall desirability 

D = (d 1 -d 2 . dj /m 


where there are m responses. 
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The individual desirability functions are structured as shown in Figure 11-17 on the 
facing page. If the objective or target T for the response y is a maximum value, 


0 y < L 



1, y > T 


( 11 - 11 ) 


when the weight r = 1, the desirability function is linear. Choosing r > 1 places more 
emphasis on being close to the target value, and choosing 0 < r < 1 makes this less 
important. If the target for the response is a minimum value, 



y < T 
T^y^U 
y > U 


( 11 - 12 ) 


The two-sided desirability function shown in Figure 11-17(c) assumes that the target is 
located between the lower (L) and upper (U) limits, and is defined as 


0 y<L 



0 y > U 


The Design-Expert software package was used to solve Example 11-2 using the 
desirability function approach. We chose T = 80 as the target for the yield response, 
L = 70, and set the weight for this individual desirability equal to unity. We set T = 65 
for the viscosity response with L = 62 and U = 68 (to be consistent with specifications), 
with both weights n = r 2 = 1. Finally, we indicated that any molecular weight between 
3200 and 3400 was acceptable. Two solutions were found. 


Solution 1: 


Solution 2: 


Time = 86.5 

Temp = 170.5 

D = 0.822 

Si = 78.8 

y 2 = 65 

y 3 = 3287 

Time = 82 

Temp = 178.8 

D = 0.792 

Si = 78.5 

$2 = 65 

s 3 = 3400 


Solution 1 has the highest overall desirability. Notice that it results in on-target viscosity 
and acceptable molecular weight. This solution is in the larger of the two operating regions 
in Figure 11-16, whereas the second solution is in the smaller region. Figure 11-18 (page 
454) shows a response and contour plot of the overall desirability function D. 
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11 -4 EXPERIMENTAL DESIGNS FOR FITTING 
RESPONSE SURFACES 

Fitting and analyzing response surfaces is greatly facilitated by the proper choice of an 
experimental design. In this section, we discuss some aspects of selecting appropriate 
designs for fitting response surfaces. 

When selecting a response surface design, some of the features of a desirable design 
are as follows: 

1. Provides a reasonable distribution of data points (and hence information) 
throughout the region of interest 

2. Allows model adequacy, including lack of fit, to be investigated 

3. Allows experiments to be performed in blocks 

4. Allows designs of higher order to be built up sequentially 

5. Provides an internal estimate of error 

6. Provides precise estimates of the model coefficients 

7. Provides a good profile of the prediction variance throughout the experimental 
region 

8. Provides reasonable robustness against outliers or missing values 

9. Does not require a large number of runs 

10. Does not require too many levels of the independent variables 

11. Ensures simplicity of calculation of the model parameters 

These features are sometimes conflicting, so judgment must often be applied in design 
selection. For more information on the choice of a response surface design, refer to Myers 
and Montgomery (1995), Box and Draper (1987), and Khuri and Cornell (1996). 


11-4.1 Designs for Fitting the First-Order Model 

Suppose we wish to fit the first-order model in k variables 

y = Po + i to, + € (11-14) 

There is a unique class of designs that minimize the variance of the regression coefficients 
{j8, }. These are the orthogonal first-order designs. A first-order design is orthogonal if 
the off-diagonal elements of the (X'X) matrix are all zero. This implies that the cross- 
products of the columns of the X matrix sum to zero. 

The class of orthogonal first-order designs includes the 2 k factorial and fractions of 
the 2* series in which main effects are not aliased with each other. In using these designs, 
we assume that the low and high levels of the k factors are coded to the usual ± 1 levels. 

The 2 k design does not afford an estimate of the experimental error unless some runs 
are replicated. A common method of including replication in the 2 k design is to augment 
the design with several observations at the center (the point jc, = 0, i = 1,2 ,..., k). The 
addition of center points to the 2 k design does not influence the {/3,} for i 2* 1, but the 
estimate of j8 0 becomes the grand average of all observations. Furthermore, the addition 
of center points does not alter the orthogonality property of the design. Example 11-1 
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Figure 11-19 The simplex design for 
(a) k = 2 variables and (b) k = 3 variables. 


illustrates the use of a 2 2 design augmented with five center points to fit a first-order 
model. 

Another orthogonal first-order design is the simplex. The simplex is a regularly sided 
figure with k + 1 vertices in k dimensions. Thus, for k = 2 the simplex design is an 
equilateral triangle and for k = 3 it is a regular tetrahedron. Simplex designs in two and 
three dimensions are shown in Figure 11-19. 


11-4.2 Designs for Fitting the Second-Order Model 

We have informally introduced in Example 11-2 (and even earlier, in Example 6-6) the 
central composite design or CCD for fitting a second-order model. This is the most 
popular class of designs used for fitting these models. Generally, the CCD consists of a 2 k 
factorial (or fractional factorial of resolution V) with n F runs, 2k axial or star runs, and n c 
center runs. Figure 11-20 (on the facing page) shows the CCD for k = 2 and k = 3 factors. 

The practical deployment of a CCD often arises through sequential experimenta¬ 
tion, as in Examples 11-1 and 11-2. That is, a 2 k has been used to fit a first-order model, 
this model has exhibited lack of fit, and the axial runs are then added to allow the 
quadratic terms to be incorporated into the model. The CCD is a very efficient design 
for fitting the second-order model. There are two parameters in the design that must be 
specified: the distance a of the axial runs from the design center and the number of center 
points n c . We now discuss the choice of these two parameters. 
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Rotatability 

It is important for the second-order model to provide good predictions throughout the 
region of interest. One way to define “good” is to require that the model have a rea¬ 
sonably consistent and stable variance of the predicted response at points of interest x. 
Recall Equation 10-40 that the variance of the predicted response at some point x is 

TOO] = oV(X'X)->x 


Box and Hunter (1957) suggested that a second-order response surface design should be 
rotatable. This means that the V[3>(x)] is the same at all points x that are the same distance 
from the design center. That is, the variance of predicted res ponse is constant on spheres. 

Figure 11-21 (page 458) shows contours of constant VV[y(x)] for the second-order 
model fit using the CCD in Example 11-2. Notice that the contours of constant standard 
deviation of predicted response are concentric circles. A design with this property will 
leave the variance of $ unchanged when the design is rotated about the center (0, 0,..., 
0), hence, the name rotatable design. 

Rotatability is a reasonable basis for the selection of a response surface design. 
Because the purpose of RSM is optimization and the location of the optimum is unknown 
prior to running the experiment, it makes sense to use a design that provides equal 
precision of estimation in all directions (it can be shown that any first-order orthogonal 
design is rotatable). 

A central composite design is made rotatable by the choice of a. The value of a for 
rotatability depends on the number of points in the factorial portion of the design; in 
fact, a = (« f ) l/4 yields a rotatable central composite design where n F is the number of 
points used in the factorial portion of the design. 


The Spherical CCD 

Rotatability is a spherical property; that is, it makes the most sense as a design criterion 
when the region of interest is a sphere. However, it is not important to have exact rotat¬ 
ability to have a good design. In fact, for a spherical region of interest, the best choice 
of a from a prediction variance viewpoint for the CCD is to set a = V/fc. This design, 
called a spherical CCD, puts all the factorial and axial design points on the surface of 
a sphere of radius \fk. For more discussion of this, see Myers and Montgomery (1995). 
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(a) Contours of Vv[J<x)] 



Figure 11-21 Contours of constant standard deviation of predicted re¬ 
sponse for the rotatable CCD, Example 11-2. 


Center Runs in the CCD 

The choice of a in the CCD is dictated primarily by the region of interest. When this 
region is a sphere, the design must include center runs to provide reasonably stable 
variance of predicted response. Generally, three to five center runs are recommended. 

The Box-Behnken Design 

Box and Behnken (1960) have proposed some three-level designs for fitting response 
surfaces. These designs are formed by combining 2* factorials with incomplete block 
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Table 11-8 A Three-Variable 

Box-Behnken Design 


2 

3 

4 

5 

6 


8 1 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 


*2 


0 

0 

0 

0 


0 

0 

0 


*3 

0 

0 

0 

0 


0 

0 

0 


designs. The resulting designs are usually very efficient in terms of the number of required 
runs, and they are either rotatable or nearly rotatable. 

Table 11-8 shows a three-variable Box-Behnken design. The design is also shown 
geometrically in Figure 11-22. Notice that the Box-Behnken design is a spherical design, 
with all points lying on a sphere of radius V2. Also, the Box-Behnken design does not 
contain any points at the vertices of the cubic region created by the upper and lower 
limits for each variable. This could be advantageous when the points on the comers of 
the cube represent factor-level combinations that are prohibitively expensive or impos¬ 
sible to test because of physical process constraints. 

Cuboidal Region of Interest 

There are many situations where the region of interest is cuboidal rather than spherical. 
In these cases, a useful variation of the central composite design is the face-centered 
central composite design or the face-centered cube, in which a = 1. This design locates 
the star or axial points on the centers of the faces of the cube, as shown in Figure 11-23 
for k = 3. This variation of the central composite design is also sometimes used because 
it requires only three levels of each factor, and in practice it is frequently difficult to 



Figure 11-22 A Box-Behnken design for three 
factors. 



Figure 11-23 A face-centered central composite design 
for Jfc = 3. 









11-4 EXPERIMENTAL DESIGNS FOR FITTING RESPONSE SURFACES 


461 




Figure 11-25 Equiradial designs for two variables, (a) Hexagon, (b) Pentagon. 


change factor levels. However, note that face-centered central composite designs are not 
rotatable. 

The face-centered cube does not require as many center points as the spherical CCD. 
In practice, n c = 2 or 3 is sufficient to provide good variance of prediction throughout 
the experimental region. It should be noted that sometimes more center runs will be 
employed to give a reasonable estimate of experiment al error. Figure 11-24 (on the facing 
page) shows the square root of prediction variance VV[^(x)] for the face-centered cube 
for k = 3 with n c = 3 center points (x 3 = 0). Notice that the standard deviation of 
predicted response is reasonably uniform over a relatively large portion of the design 
space. 

Other Designs 

There are many other response surface designs that are occasionally useful in practice. 
For two variables, we could use designs consisting of points that are equally spaced on 
a circle and that form regular polygons. Because die design points are equidistant from 
the origin, these arrangements are often called equiradial designs. 

For k = 2, a rotatable equiradial design is obtained by combining n 2 3 s 5 points 
equally spaced on a circle with «, 3* 1 points at the center of the circle. Particularly 
useful designs for k = 2 are the pentagon and hexagon. These designs are shown in 
Figure 11-25. Other useful designs include the small composite design, consisting of a 
fractional factorial in the cube of resolution HI* (main effects aliased with two-factor 
interactions and no two-factor interactions aliased with each other) and the usual axial 
and center runs, and the class of hybrid designs. Both of these designs may be of 
considerable value when it is important to reduce the number of runs as much as possible. 

A small composite design for k = 3 factors is shown in Table 11-9 (on the next 
page). This design uses the standard one-half fraction of the 2 3 in the cube because it 
meets the resolution III* criteria. The design has four runs in the cube and six axial runs, 
and it must have at least one center point. Thus the design has a minimum of N = 11 
trials, and the second-order model in k = 3 variables has p = 10 parameters to estimate, 
so this is a very efficient design with respect to the number of runs. The design in Table 
11-9 has n c — 4 center runs. We selected a = 1.73 to give a spherical design because 
the small composite design cannot be made rotatable. 

A hybrid design for k = 3 is shown in Table 11-10 on the next page. Some of these 
designs have irregular levels, and this can be a limiting factor in their application. How- 
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Table 11-9 A Small Composite 

Design for k = 3 Factors 


Standard 

Order__*2_^L 


2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 


1.00 1.00 -1.00 

1.00 -1.00 1.00 

-1.00 1.00 1.00 

-1.00 -1.00 -1.00 

-1.73 0.00 0.00 

1.73 0.00 0.00 

0.00 -1.73 0.00 

0.00 1.73 0.00 

0.00 0.00 -1.73 

0.00 0.00 1.73 

0.00 0.00 0.00 

0.00 0.00 0.00 

0.00 0.00 0.00 

0.00 0.00 0.00 


ever, they are very small designs, and they have excellent prediction variance properties. 
For more details about small composite designs and hybrid designs, refer to Myers and 
Montgomery (1995). 


11-4.3 Blocking in Response Surface Designs 

When using response surface designs, it is often necessary to consider blocking to elim¬ 
inate nuisance variables. For example, this problem may occur when a second-order 
design is assembled sequentially from a first-order design, as was illustrated in Examples 
11-1 and 11-2. Considerable time may elapse between the running of the first-order 


Table 11-10 A Hybrid Design 
for Ic = 3 Factors 


Standard 
Order x, 

1 0.00 

2 0.00 

3 -1.00 

4 1.00 

5 -1.00 

6 1.00 

7 1.41 

8 -1.41 

9 0.00 
0.00 
0.00 


*2 *3 

0.00 1.41 

0.00 -1.41 

-1.00 0.71 

-1.00 0.71 

1.00 0.71 

1.00 0.71 

0.00 -0.71 

0.00 -0.71 

1.41 -0.71 

-1.41 -0.71 

0.00 0.00 


10 
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design and the running of the supplemental experiments required to build up a second- 
order design, and during this time test conditions may change, thus necessitating 
blocking. 

A response surface design is said to block orthogonally if it is divided into blocks 
such that block effects do not affect the parameter estimates of the response surface 
model. If a 2 k or 2 k ~ p design is used as a first-order response surface design, the methods 
of Chapter 7 may be used to arrange the runs in 2 r blocks. The center points in these 
designs should be allocated equally among the blocks. 

For a second-order design to block orthogonally, two conditions must be satisfied. 
If there are n h observations in the bth block, then these conditions are 

1. Each block must be a first-order orthogonal design; that is, 

2 x iu x JU = 0 i * j = 0, 1,..., k for all b 

where x iu and x ju are the levels of ith and j'th variables in the Mth run of the 
experiment with x 0u = 1 for all u. 

2. The fraction of the total sum of squares for each variable contributed by every 
block must be equal to the fraction of the total observations that occur in the 
block; that is, 


2 4 , 

fr- 

where N is the number of runs in the design. 


As an example of applying these conditions, consider a rotatable central composite 
design in k — 2 variables with N = \2 runs. We may write the levels of x x and x 2 for 
this design in the design matrix 


*i * 2 

-1 -1 

1 -1 

-1 1 

1 1 

0 0 

0 0 

1.414 0 

-1.414 0 

0 1.414 

0 -1.414 

0 0 

0 0 


Block 1 


Block 2 


Notice that the design has been arranged in two blocks, with the first block consisting 
of the factorial portion of the design plus two center points and the second block con- 
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sisting of the axial points plus two additional center points. It is clear that condition 1 is 
met; that is, both blocks are first-order orthogonal designs. To investigate condition two, 
consider first block 1 and noti 

2 *?. 

i*. 

Therefore, 


4 _ 6 

8 “ 12 

Thus, condition 2 is satisfied in block 1. For block 2, we have 

and n 2 = 6 


«2 
N 

6_ 

12 

Because condition 2 is also satisfied in block 2, this design blocks orthogonally. 

In general, the central composite design can always be constructed to block orthog¬ 
onally in two blocks with the first block consisting of n F factorial points plus n CF center 
points and the second block consisting of n A = 2k axial points plus n CA center points. 
The first condition for orthogonal blocking will always hold regardless of the value used ** 
for a in the design. For the second condition to hold, 


E *1 U = E X 2u = 4 

Therefore, 

JU 

E xL 

4 

8 : 


:e that 

= E *L = 4 

= E x 2 u = 8 and «, = 6 


E xfu 
ixi 


n 2 

«1 


n A + n CA 

n F + n CF 


(11-15) 


The left-hand side of Equation 11-15 is 2 o?ln F , and after substituting in this quantity, 
we may solve the equation for the value of a that will result in orthogonal blocking as 


a = 


n F (n A + n CA ) 
2(71/7 + n CF ) 


(11-16) 
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This value of a does not, in general, result in a rotatable or spherical design. If the 
design is also required to be rotatable, then a = (n F ) m and 


xi /2 _ + n CA ) 

M - 2( V + n CF ) 


(11-17) 


It is not always possible to find a design that exactly satisfies Equation 1 1 - 17 . For example 
if k = 3, n F = 8 and n A = 6, and Equation 11-17 reduces to 


( 8) 1/2 = 


8(6 + n CA ) 
2(8 + n CF ) 
48 + 8 n CA 
16 + 2 n CF 


It is impossible to find values of n CA and n CF that exactly satisfy this last equation. 
However, note that if n CF = 3 and n CA = 2, then the right-hand side is 


48 + 8(2) 
16 + 2(3) 


= 2.91 


so the design nearly blocks orthogonally. In practice, one could relax somewhat the 
requirement of either rotatability or orthogonal blocking without any major loss of 
information. 

The central composite design is very versatile in its ability to accommodate blocking. 
If k is large enough, the factorial portion of the design can be divided into two or more 
blocks. (The number of factorial blocks must be a power of 2, with the axial portion 
forming a single block.) Table 11-11 presents several useful blocking arrangements for 
the central composite design. 


Table 11-11 Some Rotatable and Near-Rotatable Central Composite Designs That Block Orthogonally 


k 

2 

3 

4 

5 

5 

j Rep. 

6 

6 

5 Re P - 

7 

7 

5 Rep. 

Factorial Block(s) 

n F 

4 

8 

16 

32 

16 

64 

32 

128 

64 

Number of blocks 

1 

2 

2 

4 

1 

8 

2 

16 

8 

Number of points in 

4 

4 

8 

8 

16 

8 

16 

8 

8 

each block 

Number of center 

3 

2 

2 

2 

6 

1 

4 

j 

1 

points in each block 
Total number of points 

7 

6 

10 

10 

22 

9 

20 

9 

9 

in each block 

Axial Block 

4 

6 

8 

10 

10 

12 

12 

14 

14 

»CA 

3 

2 

2 

4 

1 

6 

2 

11 

4 

Total number of points 

7 

8 

10 

14 

11 

18 

14 

25 

18 

in the axial block 

Total number of points 

14 

20 

30 

54 

33 

90 

54 

169 

80 

N in the design 

Values of a 

Orthogonal blocking 

1.4142 

1.6330 

2.0000 

2.3664 

2.0000 

2.8284 

2.3664 

3.3636 

2.8284 

Rotatability 

1.4142 

1.6818 

2.0000 

2.3784 

2.0000 

2.8284 

2.3784 

3.3333 

2.8284 
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There are two important points about the analysis of variance when the response 
surface design has been run in blocks. The first concerns the use of center points to 
calculate an estimate of pure error. Only center points that are run in the same block can 
be considered to be replicates, so the pure error term can only be calculated within each 
block. If the variability is consistent across blocks, then these pure error estimates could 
be pooled. The second point concerns the block effect. If the design blocks orthogonally 
in m blocks, the sum of squares for blocks is 

SS B i ocks = i - - % (11-18) 

6=i n„ N 

where B b is the total of the n h observations in the bth block and G is the grand total of 
all N observations in all m blocks. When blocks are not exactly orthogonal, the general 
regression significance test (the “extra sum of squares” method) described in Chapter 
10 can be used. 


11'4.4 Computer-Generated (Optimal) Designs 

The standard response surface designs discussed in the previous sections, such as the 
central composite design, the Box-Behnken design, and their variations (such as the 
face-centered cube) are widely used because they are quite general and flexible designs. 
If the experimental region is either a cube or a sphere, there is typically a standard 
response surface design that will be applicable to die problem. However, occasionally 
an experimenter encounters a situation where a standard response surface design may 
not be the obvious choice. Computer-generated designs are an alternative to consider 
in these cases. 

There are three situations where some type of computer-generated design may be 
appropriate. 

1. An irregular experimental region. If the region of interest for the experiment is not 
a cube or a sphere, standard designs may not be the best choice. Irregular regions of 
interest occur fairly often. For example, an experimenter is investigating the properties 
of a particular adhesive. The adhesive is applied to two parts and then cured at an elevated 
temperature. The two factors of interest are the amount of adhesive applied and the cure 
temperature. Over the ranges of these two factors, taken as — 1 to +1 on the usual coded 
variable scale, the experimenter knows that if too little adhesive is applied and the cure 
temperature is too low, the parts will not bond satisfactorily. In terms of the coded 
variables, this leads to a constraint on the design variables, say 
— 1.5 x { + x 2 

where x 1 represents the application amount of adhesive and x 2 represents the temperature. 
Furthermore, if the temperature is too high and too much adhesive is applied, the parts 
will be either damaged by heat stress or an inadequate bond will result. Thus, there is 
another constraint on the factor levels 

*i + x 2 =£ 1 

Figure 11-26 (on the next page) shows the experimental region that results from applying 
these constraints. Notice that the constraints effectively remove two comers of the square, 
producing an irregular experimental region (sometimes these irregular regions are called 
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Figure 11-26 A constrained design region in two variables. 


“dented cans”). There is no standard response surface design that will fit exactly into 
this region. 

2. A nonstandard model. Usually an experimenter elects a first- or second-order re¬ 
sponse surface model, realizing that this empirical model is an approximation to the true 
underlying mechanism. However, sometimes the experimenter may have some special 
knowledge or insight about the process being studied that may suggest a nonstandard 
model. For example, the model 

y = Po + P\Xi + 132X2 + P 12 xix 2 + p u x 2 i + P 22 X 2 

+ Pmxfa + j8m 2 *l*2 + 6 

may be of interest. The experimenter would be interested in obtaining an efficient design 
for fitting this reduced quartic model. As another illustration, sometimes we encounter 
response surface problems where some of the design factors are categorical variables. 
There are no standard response surface designs for this situation [refer to Myers and 
Montgomery (1995) for a discussion of categorical variables in response surface 
problems]. 

3. Unusual sample size requirements. Occasionally an experimenter may need to re¬ 
duce the number of runs required by a standard response surface design. For example, 
suppose we intend to fit a second-order model in four variables. The central composite 
design for this situation requires between 28 and 30 runs, depending on the number of 
center points selected. However, the model has only 15 terms. If the runs are extremely 
expensive or time-consuming, the experimenter will want a design with fewer trials. 
Although computer-generated designs can be used for this purpose, there are usually 
better approaches. For example, a small composite design can be constructed for four 
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factors with 20 runs, including four center points, and a hybrid design with as few as 16 
runs is also available. These are generally superior choices to using a computer-generated 
design to reduce the number of trials. 

Much of the development of computer-generated designs is an outgrowth of work 
by Kiefer (1959, 1961) and Kiefer and Wolfowitz (1959) in the theory of optimal de¬ 
signs. By an optimal design, we mean a design that is “best” with respect to some 
criterion. Computer programs are required to construct these designs. The usual approach 
is to specify a model, determine the region of interest, select the number of runs to make, 
specify the optimality criterion, and then choose the design points from a set of candidate 
points that the experimenter would consider using. Typically, the candidate points are a 
grid of points spaced over the feasible design region. 

There are several popular design optimality criteria. Perhaps the most widely used 
is the D-optimality criterion. A design is said to be D-optimal if 

Kx'xrM 

is minimized. It turns out that a D-optimal design minimizes the volume of the joint 
confidence region on the vector of regression coefficients. A measure of the relative 
efficiency of design 1 to design 2 according to the D-criterion is given by 



where Xi and X 2 are the X matrices for the two designs and p is the number of model 
parameters. 

The A-optimality criterion deals with only the variances of the regression coeffi¬ 
cients. A design is A-optimal if it minimizes the sum of the main diagonal elements of 
(X'X ) -1 (this is called the trace of (X'X) -1 , usually denoted tr(X'X) -1 ). Thus an A- 
optimal design minimizes the sum of the variances of the regression coefficients. 

Because many response surface experiments are concerned with prediction of the 
response, prediction variance criteria are of considerable practical interest. Perhaps the 
most popular of these is the G-optimality criterion. A design is said to be G-optimal if 
it minimizes the maximum scaled prediction variance over the design region. That is, if 
the maximum value of 

NV[y 00 ] 

a 2 

over the design region is a minimum, where N is the number of points in the design. If 
the model has p parameters, the G-efficiency of a design is just 


G e 


P 

iW[y(x)l 

max- 5 — 

cr 


( 11 - 20 ) 


The V-criterion considers the prediction variance at a set of points of interest in the 
design region, say Xi, x 2 ,..., x m . The set of points could be the candidate set from which 
the design was selected, or it could be some other collection of points that have specific 
meaning to the experimenter. A design that minimizes the average prediction variance 
over this set of m points is a V-optimal design. 

Collectively, the design criteria that we have been discussing are often called al¬ 
phabetic optimality criteria. There are some situations for which the alphabetically 
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optimal design is either known or can be constructed analytically. A good example is 
the 2* design, which is D-, A-, G-, and V-optimal for fitting the first-order model in k 
variables or for fitting the first-order model with interaction. However, in most cases, the 
optimal design is not known and a computer-based algorithm must be employed to find 
a design. Many statistical software packages that support designed experiments have this 
capability. Most of the design construction procedures are based on the exchange al¬ 
gorithm. Essentially, a grid of candidate points is selected by the experimenter and an 
initial design is selected (perhaps by random) from this set of points. Then the algorithm 
exchanges points that are in the grid but not in the design with points currently in the 
design in an effort to improve the selected optimality criterion. Because not every pos¬ 
sible design is explicitly evaluated, there is no guarantee that an optimal design has been 
found, but the exchange procedure usually ensures that a design that is ‘ ‘close’ ’ to optimal 
results. Some implementations repeat the design construction process several times, start¬ 
ing from different initial designs, to increase the likelihood that a final design that is very 
near the optimal will result. 

To illustrate some of these ideas, consider the adhesive experiment discussed pre¬ 
viously that led to the irregular experimental region in Figure 11-26. Suppose that the 
response of interest is pull-off force and that we wish to fit a second-order model to this 
response. In Figure 11-27a (page 470) we show a central composite design with four 
center points (12 runs total) inscribed inside this region. This is not a rotatable design, 
but it is the largest CCD that we can fit inside the design space. For this design | (X'X) -1 | 
= 1.852 E-2, and the trace of (X'X) -1 is 6.375. Also shown in Figure 11-27a are the 
contours of constant standard deviation of the predicted response, calculated assuming 
that a = 1. Figure 11-276 shows the corresponding response surface plot. 

Figure ll-28a (page 471) and Table 11-12 (page 472) show a 12-run D-optimal 
design for this problem, generated with the Design-Expert software package. For this 
design, | (X'X) -1 1 = 2.153 E-4. Notice that the D-criterion is considerably better for this 
design than for the inscribed CCD. The relative efficiency of the inscribed CCD to the 
D-optimal design is 



That is, the inscribed CCD is only 47.6 percent as efficient as the D-optimal design. This 
implies that the CCD would have to be replicated 1/0.476 = 2.1 times (or approximately 
twice) to have the same precision of estimation for the regression coefficients as achieved 
with the D-optimal design. The trace of (X'X) -1 is 2.516 for the D-optimal design, 
indicating that the sum of the variances of the regression coefficients are considerably 
smaller for this design than for the CCD. Figure 11-28a and 6 also shows the contours 
of constant standard deviation of predicted response and the associated response surface 
plot (assuming that cr = 1). Generally, the prediction standard deviation contours are 
lower for the D-optimal design than for the inscribed CCD, particularly near the bound¬ 
aries of the region of interest where the inscribed CCD does not have any design points. 

Figure 11-29a (page 473) shows a third design, created by taking the two replicates 
at the comers of the region in the D-optimal design and moving them to the design 
center. This could be a useful idea, because Figure 11-286 shows that the standard de¬ 
viation of predicted response increases slightly near the center of the design region for 
the D-optimal design. Figure 11-29a also shows the contours of constant standard de- 
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Table 11-12 A D-Optimal Design 
for the Constrained 
Region in Figure 11-26 

Standard 

Order __ x 2 

1 -0.50 -1.00 

2 1.00 0.00 

3 -0.08 -0.08 

4 -1.00 1.00 

5 1.00 -1.00 

6 0.00 1.00 

7 -1.00 0.25 

8 0.25 -1.00 

9 -1.00 -0.50 

10 1.00 0.00 

11 0.00 1.00 

12 -0.08 -0.08 


viation of prediction for this modified D-optimal design, and Figure 11-296 shows the 
response surface plot. The D-criterion for this design is | (X'X) -1 1 = 3.71 E-4, and the 
relative efficiency is 



That is, this design is almost as efficient as the D-optimal design. The trace of (X'X) -1 
is 2.448 for this design, a slightly larger value than was obtained for the D-optimal design. 
The contours of constant prediction standard deviation for this design visually look at 
least as good as those for the D-optimal design, particularly in the center of the region. 

Computer-generated designs based on alphabetic optimality criteria can certainly be 
useful in situations where the experimental region is not either spherical or cuboidal. 
However, they are not replacements for standard designs in most problems. Alphabeti¬ 
cally optimal designs are generated by strict adherence to one criterion only, and as we 
noted at the start of Section 11-4, where several different design criteria were listed, 
including several that are somewhat qualitative or subjective in nature. In real experi¬ 
mental problems, there are usually many criteria that need to be evaluated in selecting 
a design. For more discussion on this issue, refer to Myers and Montgomery (1995, 
Chapter 8). 


11 -5 MIXTURE EXPERIMENTS 


In previous sections, we have presented response surface designs for those situations in 
which the levels of each factor are independent of the levels of other factors. In mixture 
experiments, the factors are the components or ingredients of a mixture, and conse¬ 
quently, their levels are not independent. For example, if x u x 2 ,.. ■, x p denote the pro¬ 
portions of p components of a mixture, then 

0 « jc, *£ 1 i = 1,2,..., p 


and 


X\ + *2 + • • • + x p = 1 


(i.e., 100 percent) 
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Figure 11-30 Constrained factor space Figure 11-31 Trilinear coordinate system, 
for mixtures with (a) p = 2 components 
and (b)p = 3 components. 


These restrictions are illustrated graphically in Figure 11-30 for p = 2 and p = 3 com¬ 
ponents. For two components, the factor space for the design includes all values of the 
two components that lie on the line segment *i + x 2 = 1, with each component being 
bounded by 0 and 1. With three components, the mixture space is a triangle with vertices 
corresponding to formulations that are pure blends (mixtures that are 100 percent of a 
single component). 

When there are three components of the mixture, the constrained experimental region 
can be conveniently represented on trilinear coordinate paper as shown in Figure 11- 
31. Each of the three sides of the graph in Figure 11-31 represents a mixture that has 
none of one of the three components (the component labeled on the opposite vertex). 
The nine grid lines in each direction mark off 10 percent increments in the respective 
components. 

Simplex designs are used to study the effects of mixture components on the response 
variable. A [p, m} simplex lattice design for p components consists of points defined 
by the following coordinate settings: the proportions assumed by each component take 
the m + 1 equally spaced values from 0 to 1, 

jc,- = 0,^,^,..., i i=l, 2,...,p (11-21) 

and all possible combinations (mixtures) of the proportions from Equation 11-21 are 
used. As an example, let p = 3 and m = 2. Then 

x, = 0, i 1 i= 1, 2, 3 
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Figure 11-32 Some simplex lattice designs for p = 3 and p = 4 components. 


and the simplex lattice consists of the following six runs: 

(*i, *2, X 3 ) = (1, 0, 0), (0, 1, 0), (0, 0, 1), (i i 0), (i 0, ±), (0, i \) 

This design is shown in Figure 11-32. The three vertices (1,0, 0), (0, 1, 0), and (0, 0, 1) 
are the pure blends, whereas the points (5, 5, 0 ), (5, 0 , 5), and ( 0 , 5, 5) are binary blends 
or two-component mixtures located at the midpoints of the three sides of the triangle. 
Figure 11-32 also shows the {3,3}, {4,2}, and {4,3} simplex lattice designs. In general, 
the number of points in a {p, m] simplex lattice design is 

= (p + m - 1)! 

m!(p - 1 )! 

An alternative to the simplex lattice design is the simplex centroid design. In a p- 
component simplex centroid design, there are 2 P — 1 points, corresponding to the p 
permutations of ( 1 , 0 , 0 ,. .., 0 ), the ({j) permutations of (5, 5, 0 ,... , 0 ), the (?) permu¬ 
tations of ( 5 , j, 5 , 0,.... 0),..., and the overall centroid ^). Figure 11-33 

shows some simplex centroid designs. 

A criticism of the simplex designs described above is that most of the experimental 




Figure 11-33 Simplex centroid designs with (a) p = 3 components and (b) p = 4 components. 
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runs occur on the boundary of the region and, consequently, include only p - 1 of the 
p components. It is usually desirable to augment the simplex lattice or simplex centroid 
with additional points in the interior of the region where the blends will consist of all p 
mixture components. For more discussion, see Cornell (1990) and Myers and Montgom¬ 
ery (1995). 

Mixture models differ from the usual polynomials employed in response surface 


work because of the constraint = 1. The standard forms of the mixture models that 

are in widespread use are 

Linear: 

E(y) = t ft*, 

(11-22) 

Quadratic: 

E(y) = 2 ft*. + E E ft/'*.'*/' 

(11-23) 

Full Cubic: 

E(y) = 2 ft*. + E E ft,*.*/ 

+ 2 2 v.*A ~ */) 

i<j 

+ 22X PijkXiXjX k 
i<j<k 

(11-24) 

Special Cubic: 

E(y) = 2 ft*. + S 2 fty*.*y 

1=1 i<j 

+ ESE PijkXiXjX k 

(11-25) 


The terms in these models have relatively simple interpretations. In Equations 11- 
22 through 11-25, the parameter )3, represents the expected response to the pure blend 
x, = 1 and Xj = 0 when j + i. The portion 2? = 1 /3,*,- is called the linear blending portion. 
When there is curvature arising from nonlinear blending between component pairs, the 
parameters fa represent either synergistic or antagonistic blending. Higher-order terms 
are frequently necessary in mixture models because ( 1 ) the phenomena studied may be 
complex and ( 2 ) the experimental region is frequently the entire operability region and 
is therefore large, requiring an elaborate model. 

EXAMPLE 11-3 . 

A Three-Component Mixture 

Cornell (1990) describes a mixture experiment in which three components—polyethyl¬ 
ene (Jr,), polystyrene (x 2 ), and polypropylene (x 3 )—were blended to form fiber that will 
be spun into yam for draperies. The response variable of interest is yam elongation in 
kilograms of force applied. A {3, 2} simplex lattice design is used to study the product. 
The design and the observed responses are shown in Table 11-13 on the next page. Notice 
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Table 11-13 The {3, 2} Simplex Lattice Design for the Yam Elongation Problem 


Design 

Point 

Component Proportions 

X, *2 *3 

Observed 
Elongation Values 

Average 

Elongation Value (y) 

1 

1 

0 

0 

11.0, 12.4 

11.7 

2 

\ 

\ 

0 

15.0, 14.8, 16.1 

15.3 

3 

0 

1 

0 

8.8, 10.0 

9.4 

4 

0 

5 

\ 

10.0, 9.7, 11.8 

10.5 

5 

0 

0 

1 

16.8, 16.0 

16.4 

6 

5 

0 

5 

17.7, 16.4, 16.6 

16.9 


that all of the design points involve either pure or binary blends; that is, at most only 
two of the three components are used in any formulation of the product. Replicate ob¬ 
servations are also run, with two replicates at each of the pure blends and three replicates 
at each of the binary blends. The error standard deviation can be estimated from these 
replicate observations as & = 0.85. Cornell fits the second-order mixture polynomial to 
the data, resulting in 

f = 11.7*! + 9.4*2 + 16.4*3 + I9.0x t x 2 + 11.4 * t * 3 - 9.6 * 2 * 3 

This model can be shown to be an adequate representation of the response. Note that 
because /3 3 > /3i > /3 2 , we would conclude that component 3 (polypropylene) produces 
yam with the highest elongation. Furthermore, because /3 12 and $ I3 are positive, blending 
components 1 and 2 or components 1 and 3 produces higher elongation values than would 
be expected just by averaging the elongations of the pure blends. This is an example of 
“synergistic” blending effects. Components 2 and 3 have antagonistic blending effects 
because /3 23 is negative. 

Figure 11-34 plots the contours of elongation, and this may be helpful in interpreting 
the results. From examining the figure, we note that if maximum elongation is desired, 
a blend of components 1 and 3 should be chosen consisting of about 80 percent com¬ 
ponent 3 and 20 percent component 1. 



Figure 11-34 Contours of constant estimated yam elonga¬ 
tion from the second-order mixture model for Example 11-3. 
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We noted previously that the simplex lattice and simplex centroid designs are bound¬ 
ary point designs. If the experimenter wants to make predictions about the properties 
of complete mixtures, it would be highly desirable to have more runs in the interior of 
the simplex. We recommend augmenting the usual simplex designs with axial runs and 
the overall centroid (if the centroid is not already a design point). 

The axis of component i is the line or ray extending from the base point x ; = 0, 
Xj = \Kp - 1) for all j ¥= i to the opposite vertex where x, = 1, Xj = 0 for ally =£ i. The 
base point will always lie at the centroid of the (p — 2 )-dimensional boundary of the 
simplex that is opposite the vertex x, = 1, x, = 0 for ally =£ i. [The boundary is sometimes 
called a (p - 2)-flat.] The length of the component axis is one unit. Axial points are 
positioned along the component axes a distance A from the centroid. The maximum 
value for A is (p - Dip- We recommend that axial runs be placed midway between the 
centroid of the simplex and each vertex so that A = (p ~ l)/2p. Sometimes these points 
are called axial check blends because a fairly common practice is to exclude them when 
fitting the preliminary mixture model and then use the responses at these axial points to 
check the adequacy of the fit of the preliminary model. 

Figure 11-35 shows the {3,2} simplex lattice design augmented with the axial points. 
This design has 10 points, with four of these points in the interior of the simplex. The 
{ 3 , 3 } simplex lattice will support fitting the full cubic model, whereas the augmented 
simplex lattice will not; however, the augmented simplex lattice will allow the experi¬ 
menter to fit the special cubic model or to add special quartic terms such as 
j 8 i 233 *ix 2 x? to the quadratic model. The augmented simplex lattice is superior for studying 
the response of complete mixtures in the sense that it can detect and model curvature in 
the interior of the triangle that cannot be accounted for by the terms in the full cubic 
model. The augmented simplex lattice has more power for detecting lack of fit than does 
the [3, 3} lattice. This is particularly useful when the experimenter is unsure about the 
proper model to use and also plans to sequentially build a model by starting with a simple 
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polynomial (perhaps first order), test the model for lack of fit, then augment the model 
with higher-order terms, test the new model for lack of fit, and so forth. 

In some mixture problems, constraints on the individual components arise. Lower 
bound constraints of the form 

/, s= x, 1 i = 1, 2,..., p 

are fairly common. When only lower bound constraints are present, the feasible design 
region is still a simplex, but it is inscribed inside the original simplex region. This situ¬ 
ation may be simplified by the introduction of pseudocomponents, defined as 



with Uj = | lj < 1. Now 


x\ + x' 2 + • • • + x'p = 1 


so the use of pseudocomponents allows the use of simplex-type designs when lower 
bounds are a part of the experimental situation. The formulations specified by the simplex 
design for the pseudocomponents are transformed into formulations for the original com¬ 
ponents by reversing the transformation Equation 11-26. That is, if x' is the value as¬ 
signed to the z'th pseudocomponent on one of the runs in the experiment, the z'th original 
mixture component is 


x,‘ = /,■ + (l - 2 Ij^x] (11-27) 

If the components have both upper and lower bound constraints, the feasible region 
is no longer a simplex; instead, it will be an irregular polytope. Because the experimental 
region is not a “standard” shape, computer-generated designs are very useful for these 
types of mixture problems. 


EXAMPLE 11-4 . 

Paint Formulation 

An experimenter is trying to optimize the formulation of automotive clear coat paint. 
These are complex products that have very specific performance requirements. Specifi¬ 
cally, the customer wants the Knoop hardness to exceed 25 and the percentage of solids 
to be below 30. The clear coat is a three component mixture, consisting of a monomer 
(xO, a crosslinker (x 2 ), and a resin (x 3 ). There are constraints on the component 
proportions: 

X\ + x 2 + x 3 = 100 
5 =£ Xi =£ 25 
25 =£ x 2 ^ 40 
50 =£ x 3 =£ 70 

The result is the constrained region of experimentation shown in Figure 11-36 on the 
next page. Because the region of interest is not a simplex, we will use a D-optimal design 
for this problem. Assuming that both responses are likely to be modeled with a quadratic 
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Monomer 

25.00 



Figure 11-36 The constrained experimental region for 
the paint formulation problem in Example 11-4 (shown in 
the actual component scale). 


Table 11-14 A D-Optimal Design for the Paint Formulation Problem 
in Example 11 -4 


Standard Monomer Crosslinker Resin Hardness Solids 

Order Run x, x 2 x 3 y x y 2 


1 2 17.50 

2 1 10.00 

3 4 15.00 

4 13 25.00 

5 ' 7 5.00 

6 3 5.00 

7 6 11.25 

8 11 5.00 

9 10 18.13 

10 14 8.13 

11 12 25.00 

12 9 15.00 

13 5 10.00 

14 8 5.00 


32.50 50.00 

40.00 50.00 

25.00 60.00 

25.00 50.00 

25.00 70.00 

32.50 62.50 

32.50 56.25 

40.00 55.00 

28.75 53.13 

28.75 63.13 

25.00 50.00 

25.00 60.00 

40.00 50.00 

25.00 70.00 


29 9.539 

26 27.33 

17 29.21 

28 30.46 

35 74.98 

31 31.5 

21 15.59 

20 19.2 

29 23.44 

25 32.49 

19 23.01 

14 41.46 

30 32.98 

23 70.95 
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Table 11-15 Model Fitting for the Hardness Response 


Response: hardness 

ANOVA for Mixture Quadratic Model 


Analysis of variance table [Partial 


Sum of 

Source Squares 

Model 279.73 

Linear Mixture 29.13 

AB 72.61 

AC 179.67 

BC 8.26 

Residual 188.63 

Lack of Fit 63.63 

Pure Error 125.00 

Cor Total 468.36 


sum of squares] 

Mean 

DF Square 

5 55.95 

2 14.56 

1 72.61 

1 179.67 

1 8.26 

8 23.58 

4 15.91 

4 31.25 

13 


Std. Dev. 
Mean 
C.V. 
PRESS 


4.86 

24.79 

19.59 

638.60 


R-Squared 
Adj R-Squared 
Pred R-Squared 
Adeq Precision 


Coefficient 


Standard 


Component 

A-Monomer 

B-Crosslinker 

C-Resin 

AB 

AC 

BC 


Estimate 

23.81 

16.40 

29.45 

44.42 

-44.01 

13.80 


Error 

3.36 

7.68 

3.36 

25.31 
15.94 

23.32 


Value Prob > F 

2.37 0.1329 

0.62 0.5630 

3.08 0.1174 

7.62 0.0247 

0.35 0.5703 

0.51 0.7354 


0.5973 

0.3455 

-0.3635 

4.975 

95% Cl 95% Cl 

Low High 

16.07 31.55 

-1.32 34.12 

21.71 37.19 

-13.95 102.80 

-80.78 -7.25 

-39.97 67.57 


Final Equation in Terms of Pseudo Components: 

hardness = 

+23.81 * A 
+ 16.40 * B 
+29.45 * C 
+44.42 * A * B 
-44.01 * A * C 
+ 13.80 * B * C 


mixture model, we can generate the D-optimal design shown in Figure 11-36 using 
Design-Expert. We assumed that in addition to the six runs required to fit the quadratic 
mixture model that four additional distinct runs would be made to check for lack of fit 
and that four of these runs would be replicated to provide an estimate of pure error. 
Design-Expert used the vertices, the edge centers, the overall centroid and the check runs 
(points located halfway between the centroid and the vertices) as the candidate points. 

The 14-run design is shown in Table 11-14 (page 480), along with the hardness and 
solids responses. The results of fitting quadratic models to both responses are summarized 
in Tables 11-15 and 11-16 (pages 481 and 482). Notice that quadratic models fit nicely 
to both the hardness and the solids responses. The fitted equations for both responses (in 
terms of the pseudocomponents) are shown in these tables. Contour plots of the responses 
are shown in Figures 11-37 and 11-38 (pages 482 and 483). 

Figure 11-39 (page 483) is an overlay plot of the two response surfaces, showing 
the Knoop hardness contour of 25 and the 30 percent contour for solids. The feasible 
region for this product is the unshaded region near the center of the plot. Obviously, 



Table 11-16 Model Fitting for the Solids Response 


Response: solids 

ANOVA for Mixture Quadratic Model 


Analysis of variance table [Partial sum of squares] 



Sum of 


Mean 

F 


Source 

Squares 

DF 

Square 

Value 

Prob > F 

Model 

4297.94 

5 

859.59 

25.78 

<0.0001 

Linear Mixture 

2931.09 

2 

1465.66 

43.95 

<0.0001 

AB 

211.20 

1 

211.20 

6.33 

0.0360 

AC 

285.67 

1 

285.67 

8.57 

0.0191 

BC 

1036.72 

1 

1036.72 

31.09 

0.0005 

Residual 

266.79 

8 

33.35 



Lack of Fit 

139.92 

4 

34.98 

1.10 

0.4633 

Pure Error 

126.86 

4 

31.72 



Cor Total 

4564.73 

13 




Std. Dev. 

5.77 


R-Squared 

0.9416 


Mean 

33.01 


Adj R-Squared 

0.9050 


C.V. 

17.49 


Pred R-Squared 

0.7827 


PRESS 

991.86 


Adeq Precision 

15.075 



Coefficient 


Standard 

95%CI 

95% Cl 

Component 

Estimate 

DF 

Error 

Low 

High 

A-Monomer 

26.53 

1 

3.99 

17.32 

35.74 

B-Crosslinker 

46.60 

1 

9.14 

25.53 

67.68 

C-Resin 

73.23 

1 

3.99 

64.02 

82.43 

AB 

-75.76 

1 

30.11 

-145.19 

-6.34 

AC 

-55.50 

1 

18.96 

-99.22 

-11.77 

BC 

-154.61 

1 

27.73 

-218.56 

-90.67 


Final Equation in Terms of Pseudo Components: 

solids = 

+26.53 * A 
+46.60 * B 
+73.23 * C 
-75.76 * A * B 
-55.50 * A * C 
-154.61 * B * C 


Monomer 



Figure 11-37 Contour plot of the Knoop hardness response. Ex¬ 
ample 11-4. 
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Monomer 




Figure 11-39 Overlay plot of the Knoop hardness and percentage 
of solids response, showing the feasible region for the paint 
formulation. 
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there are a number of choices for the proportions of monomer, crosslinker, and resin for 
the clear coat that will give a product satisfying the performance requirements. 


11 -6 EVOLUTIONARY OPERATION 

Response surface methodology is often applied to pilot plant operations by research and 
development personnel. When it is applied to a full-scale production process, it is usually 
only done once (or very infrequently) because the experimental procedure is relatively 
elaborate. However, conditions that were optimum for the pilot plant may not be optimum 
for the full-scale process. The pilot plant may produce 2 pounds of product per day, 
whereas the full-scale process may produce 2000 pounds per day. This “scale-up” of 
the pilot plant to the full-scale production process usually results in distortion of the 
optimum conditions. Even if the full-scale plant begins operation at the optimum, it will 
eventually “drift” away from that point because of variations in raw materials, environ¬ 
mental changes, and operating personnel. 

A method is needed for the continuous monitoring and improvement of a full-scale 
process with the goal of moving the operating conditions toward the optimum or follow¬ 
ing a “drift.” The method should not require large or sudden changes in operating 
conditions that might disrupt production. Evolutionary operation (EVOP) was pro¬ 
posed by Box (1957) as such an operating procedure. It is designed as a method of 
routine plant operation that is carried out by manufacturing personnel with minimum 
assistance from the research and development staff. 

EVOP consists of systematically introducing small changes in the levels of the op¬ 
erating variables under consideration. Usually, a 2 k design is employed to do this. The 
changes in the variables are assumed to be small enough that serious disturbances in 
yield, quality, or quantity will not occur, yet large enough that potential improvements 
in process performance will eventually be discovered. Data are collected on the response 
variables of interest at each point of the 2 k design. When one observation has been taken 
at each design point, a cycle is said to have been completed. The effects and interactions 
of the process variables may then be computed. Eventually, after several cycles, the effect 
of one or more process variables or their interactions may appear to have a significant 
effect on the response. At this point, a decision may be made to change the basic operating 
conditions to improve the response. When improved conditions have been detected, a 
phase is said to have been completed. 

In testing the significance of process variables and interactions, an estimate of ex¬ 
perimental error is required. This is calculated from the cycle data. Also, the 2 k design 
is usually centered about the current best operating conditions. By comparing the re¬ 
sponse at this point with the 2 k points in the factorial portion, we may check on curvature 
or change in mean (CIM); that is, if the process is really centered at the maximum, say, 
then the response at the center should be significantly greater than the responses at the 
2 A -peripheral points. 

In theory, EVOP can be applied to k process variables. In practice, only two or three 
variables are usually considered. We will give an example of the procedure for two 
variables. Box and Draper (1969) give a detailed discussion of the three-variable case, 
including necessary forms and worksheets. Myers and Montgomery (1995) discuss the 
computer implementation of EVOP. 
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Figure 11-40 A 2 2 design for EVOP. 


EXAMPLE 11-5 . 

Consider a chemical process whose yield is a function of temperature (jt,) and pressure 
(x 2 ). The current operating conditions are x t = 250°F and x 2 = 145 psi. The EVOP 
procedure uses the 2 2 design plus the center point shown in Figure 11-40. The cycle is 
completed by running each design point in numerical order (1, 2, 3, 4, 5). The yields in 
the first cycle are also shown in Figure 11-40. 

The yields from the first cycle are entered in the EVOP calculation sheet, shown in 
Table 11-17. At the end of the first cycle, no estimate of the standard deviation can be 


Table 11-17 EVOP Calculation Sheet for Example 11-5, n = 1 


5 0 3 

2i -U 

Cycle: n = 1 
Response: Yield 

Phase: 1 

Date: 1/11/00 


Calculation of Averages 

Calculation of 
Standard 
Deviation 

Operating Conditions 

(i) Previous cycle sum 

(ii) Previous cycle average 

(1) (2) (3) (4) 

(5) 

Previous sum S = 
Previous average 

(iii) New observations 

84.5 84.2 84.9 84.5 

84.3 New S = range X 

(iv) Differences [(ii) — (iii)] 

(v) New sums [(i) + (iii)] 

(vi) New averages [y, = (v)/n] 

84.5 84.2 84.9 84.5 

84.5 84.2 84.9 84.5 

Range of (iv) = 
84.3 New sum S = 

84.3 New average S = 

New sum S 

Calculation ( 

rf Effects 

Calculation of Error Limits 

Temperature effect = j(y 3 + y 4 

Pressure effect = 5(5*3 + y 5 — y 2 
T/P interaction effect = j(y 2 H 

Change-in-mean effect = ^(y 2 + 

- y-i ~ b) = 0.45 

: - y 4 ) = 0.25 

1- B ~ B ~ B) = 0.15 

B + B + B - 4y,) = 0.02 

2 

For new average = —y= S = 

V n 

For new effects —p S = 

Vn 

- , • 1-78 „ 

For change in mean S = 
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Table 11-18 EVOP Calculation Sheet for Example 11-5, n = 2 


5 H 3 

2i -I4 


Cycle: n = 2 
Response: Yield 


Phase: 1 

Date: 1/11/00 



Calculation of Averages 


Calculation of 
Standard 
Deviation 

Operating Conditions 

0) 

(2) 

(3) 

(4) 

(5) 


(i) Previous cycle sum 

84.5 

84.2 

84.9 

84.5 

84.3 

Previous sum S = 

(ii) Previous cycle average 

84.5 

84.2 

84.9 

84.5 

84.3 

Previous average S = 

(iii) New observations 

84.9 

84.6 

85.9 

83.5 

84.0 

New S = range X 

A. = 0.60 

(iv) Differences [(ii) — (iii)] 

-0.4 

-0.4 

-1.0 

+ 1.0 

0.3 

Range of (iv) = 2.0 

(v) New sums [(i) -1- (iii)] 

169.4 

168.8 

170.8 

168.0 

168.3 

New sum S = 0.60 

(vi) New averages [y, = (v)/n] 

84.70 

84.40 

85.40 

84.00 

84.15 

New average 5 = 

New sum S „ „„ 

-j— = 0.60 

Calculation of Effects 



Calculation of Error Limits 


Temperature effect = j(y 3 + y 4 — y 2 — y 5 ) = 0.43 


e effect = j(y 3 + y 3 - y 2 - yd = 0.58 
TXP interaction effect = j(y 2 + y 3 ~ ~ ys) = 0.83 

Change-in-mean effect = %(y 2 + y 3 + y 4 + y 5 - 4y,) = 
-0.17 


For new average = —7= S = 0.85 
Vh 

2 

For new effects —7= S = 0.85 
Vn 

1.78 

For change in mean -^= S = 0.76 


made. The effects and interaction for temperature and pressure are calculated in the usual 
manner for a 2 2 design. 

A second cycle is then run and the yield data entered in another EVOP calculation 
sheet, shown in Table 11-18. At the end of the second cycle, the experimental error can 
be estimated and the estimates of the effects compared to approximate 95 percent (two 
standard deviation) limits. Note that the range refers to the range of the differences in 
row (iv); thus the range is +1.0 - (—1.0) = 2.0. Because none of the effects in Table 
11-18 exceed their error limits, the true effect is probably zero, and no changes in op¬ 
erating conditions are contemplated. 

The results of a third cycle are shown in Table 11-19 on the facing page. The effect 
of pressure now exceeds its error limit and the temperature effect is equal to the error 
limit. A change in operating conditions is now probably justified. 

In light of the results, it seems reasonable to begin a new EVOP phase about point 
(3). Thus, jci = 225°F and x 2 = 150 psi would become the center of the 2 2 design in the 
second phase. 

An important aspect of EVOP is feeding the information generated back to the 
process operators and supervisors. T hi s is accomplished by a prominently displayed 
EVOP information board. The in formation board for this example at the end of cycle 3 
is shown in Table 11-20 on the facing page. 
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Table 11-19 EVOP Calculation Sheet for Example 11-5, n = 3 


5 0 3 


Cycle: n = 

3 


Phase: 1 


Response: Yield 


Date: 1/11/00 






Calculation of 






Standard 



Calculation of Averages 


Deviation 

Operating Conditions 

(1) 

(2) (3) 

(4) 

(5) 


(i) Previous cycle sum 

169.4 

168.8 170.8 

168.0 

168.3 

Previous sum S = 0.60 

(ii) Previous cycle average 

84.70 

84.40 85.40 

84.00 

84.15 

Previous average S = 






0.60 

(iii) New observations 

85.0 

84.0 86.6 

84.9 

85.2 

New S = range X 






/s.„ = 0.56 

(iv) Differences [(ii) - (iii)] 

-0.30 

+0.40 -1.20 

-0.90 

-1.05 

Range of (iv) = 1.60 

(v) New sums [(i) + (iii)] 

254.4 

252.8 257.4 

252.9 

253.5 

New sum S = 1.16 

(vi) New averages [y, = (v)/n] 

84.80 

84.27 85.80 

84.30 

84.50 

New average S = 






New sum S „ „„ 






—-- - — = 0.58 

Calculation of Effects 



Calculation of Error Limits 

Temperature effect = |(y 3 + y 4 

-yi~ 

y 5 ) = 0.67 

For new average = S = 0.67 

Pressure effect = y(v 3 + y, — y 

z -y 4 ) 

= 0.87 




T X P interaction effect = j(y 2 ■ 
0.64 


- Ys) = 

For new effects —7= S = 0.67 

VH 

Change-in-mean effect = j(y 2 + 
= -0.07 

yi + y* + ys - 4y0 

For change 

1.78 

in mean —7= S = 0.60 
\Tn 


Table 11-20 EVOP Information Board, Cycle 3 


Response: Percent Yield 


Reqi 

uirement: Maximize 


150 

84.50 

85.80 

£ 

1 145 

_ 


£ 



140 

P 84.27 

1 1 

84.30 

1 


245 250 

255 


Temperature 



Error Limits for Averages: 

±0.67 


Effects with Temperature 0.67 ±0.67 

95 percent error Pressure 0.87 ±0.67 

Limits: T X P 0.64 ±0.67 


Change in mean 0.07 ±0.60 

0.58 


Standard deviation: 
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Table 11-21 Values of f kn 

n = _2_3_4_5_6_7_8_ 9 10 

k = 5 0.30 0.35 0.37 0.38 0.39 0.40 0.40 0.40 0.41 

9 0.24 0.27 0.29 0.30 0.31 0.31 0.31 0.32 0.32 

10 0.23 0.26 0.28 0.29 0.30 0.30 0.30 0.31 0.31 


Most of the quantities on the EVOP calculation sheet follow directly from the 
analysis of the 2* factorial design. For example, the variance of any effect, such as 
j(y 3 + y 5 ~ y-i — %), is simply a 2 /n where a 2 is the variance of the observations (y). 
Thus, two standard deviation (corresponding to 95 percent) error limits on any effect 
would be ±2al\fn. The variance of the change in mean is 


V(CIM) 


-y[\ 


(y 2 + y3 + y4 + y 5 ~ 4 yd 


-m 


Thus, two standard deviation error limits on the CIM are ± (2V20/25)(t/Vn = 
±1.78tr/V«. 

The standard deviation cr is estimated by the range method. Let y, (n) denote the 
observation at the ith design point in cycle n and y,(n) denote the corresponding average 
of y t (n) after n cycles. The quantities in row (iv) of the EVOP calculation sheet are the 
differences y,(«) — y,(n — 1). The variance of these differences is 


nyM - 5M n - 1)] - [l + 5TT1)] = jr=T) 


The range of the differences, say R D , is related to the estimate of the standard deviation 
of the differences by & D = R D /d 2 . The fa ctor d 2 dep ends on the number of observations 
used in computing R D . Now R D /d 2 = V n/(n — 1), so 




can be used to estimate the standard deviation of the observations, where k denotes the 
number of points used in the design. For a 2 2 design with one center point we have k = 
5, and for a 2 3 design with one center point we have k = 9. Values of f kn are given in 
Table 11-21. 


11-7 ROBUST DESIGN 
11-7.1 Background 

Throughout this book, we have stressed the importance of using statistically designed 
experiments for product and process design, development, and improvement. Since about 
1980, engineers and scientists have become increasingly aware of the benefits of using 
designed experiments, and as a consequence, there have been many new application 
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areas. One of the most important of these is in robust design, where the focus is on one 
or more of the following: 

1. Designing systems (products or processes) that are insensitive to environmental 
factors that can affect performance once the system is deployed in the field. An 
example is the formulation of an exterior paint that should exhibit long life when 
exposed to a variety of weather conditions. Because weather conditions are not 
entirely predictable, and are certainly not constant, the product formulator wants 
the product to be robust against a wide range of temperature, humidity, and 
precipitation factors that affect the wear and finish of the paint. 

2. Designing products so that they are insensitive to variability transmitted by the 
components of the system. An example is designing an electronic amplifier so 
that the output voltage is as close as possible to the desired target regardless of 
the variability in the electrical parameters of the resistors, transistors, and power 
supplies that are the components of the device. 

3. Designing processes so that the manufactured product will be as close as possible 
to the desired target specifications even though some process variables (such as 
temperature) or raw material characteristics are impossible to control precisely. 

4. Determining the operating conditions for a process so that critical product 
characteristics are as close as possible to the desired target value and the 
variability around this target is minimized. Examples of this type of problem 
occur frequently. For instance in semiconductor manufacturing we would like 
the oxide thickness on a wafer to be as close as possible to the target mean 
thickness, and we would also like for the variability in thickness across the wafer 
(a measure of uniformity) to be as small as possible. 

In the early 1980s, a Japanese engineer, Dr. Genichi Taguchi, introduced an approach 
to solving these types of problems, which he collectively referred to as the robust pa¬ 
rameter design (RPD) problem [see Taguchi and Wu (1980), Taguchi (1987, 1991)]. 
His approach was based on classifying the variables in a process or product as either 
control (or controllable) variables or noise (or uncontrollable) variables and then 
finding the settings for the controllable variables that minimized the variability trans¬ 
mitted to the response from the uncontrollable variables. We make the assumption that 
although the noise factors are uncontrollable in the full-scale system, they can be con¬ 
trolled for purposes of an experiment. Refer to Figure 1-1 for a graphical view of con¬ 
trollable and uncontrollable variables in the general context of a designed experiment. 

Taguchi introduced some novel statistical methods and some variations on estab¬ 
lished techniques as part of his RPD procedure. He made use of highly fractionated 
factorial designs and other types of fractional designs obtained from orthogonal arrays. 
His methodology generated considerable debate and controversy. Part of the controversy 
arose because Taguchi’s methodology was advocated in the West initially (and primarily) 
by entrepreneurs, and the underlying statistical science had not been adequately peer- 
reviewed. By the late 1980s the results of a very comprehensive peer review indicated 
that although Taguchi’s engineering concepts and the overall objective of RPD were 
well-founded, there were substantial problems with his experimental strategy and meth¬ 
ods of data analysis. For specific details of these issues, see Box (1988), Box, Bisgaard, 
and Fung (1988), Hunter (1985) (1989), Montgomery (1999), Myers and Montgomery 
(1995), and Pignatiello and Ramberg (1992). Many of these concerns are also summa- 
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rized in the extensive panel discussion in the May 1992 issue of Technometrics [see Nair 
et al. (1992)]. The supplemental technical material for this chapter also discusses and 
illustrates many of the problems underlying Taguchi’s technical methods. 

Taguchi’s methodology for the RPD problem revolves around the use of an orthog¬ 
onal design for the controllable factors that is “crossed” with a separate orthogonal 
design for the noise factors. Table 11-22 (page 490) presents an example from Byrne 
and Taguchi (1987) that involved the development of a method to assemble an elasto- 
metric connector to a nylon tube that would deliver the required pull-off force. There are 
four controllable factors, each at three levels (A = interference, B = connector wall 
thickness, C = insertion depth, D = percent adhesive), and three noise or uncontrollable 
factors, each at two levels (E = conditioning time, F = conditioning temperature, G = 
conditioning relative humidity). Panel (a) of Table 11-22 contains the design for the 
controllable factors. Notice that the design is a three-level fractional factorial; specifi¬ 
cally, it is a 3 4-2 design. Taguchi calls this the inner array design. Panel (b) of Table 
11-22 contains a 2 3 design for the noise factors, which Taguchi calls the outer array 
design. Now each run in the inner array is performed for all treatment combinations in 
the outer array, producing the 72 observations on pull-off force shown in the table. This 
type of design is called a crossed array design. 

Taguchi suggested that we summarize the data from a crossed array experiment with 
two statistics: the average of each observation in the inner array across all runs in the 
outer array and a summary statistic that attempted to combine information about the 
mean and variance, called the signal-to-noise ratio. These signal-to-noise ratios are 
purportedly defined so that a maximum value of the ratio minimized variability trans¬ 
mitted from the noise variables. Then an analysis is performed to determine which set¬ 
tings of the controllable factors result in (1) the mean as close as possible to the desired 
target and (2) a maximum value of the signal-to-noise ratio. 

Examination of Table 11-22 reveals a major problem with the Taguchi design strat¬ 
egy; namely, the crossed array approach will lead to a very large experiment. In our 
example, there are only seven factors, yet the design has 72 runs. Furthermore, the inner 
array design is a 3 4-2 resolution III design (see Chapter 9 for discussion of this design), 
so in spite of the large number of runs, we cannot obtain any information about inter¬ 
actions among the controllable variables. Indeed, even information about the main effects 
is potentially tainted because the main effects are heavily aliased with the two-factor 
interactions. It also turns out that the Taguchi signal-to-noise ratios are problematic; 
maximizing the ratio does not necessarily minimize variability. Refer to the supplemental 
text material for more details. 

An important point about the crossed array design is that it does provide information 
about controllable factor X noise factor interactions. These interactions are crucial to the 
solution of a RPD problem. For example, consider the two-factor interaction graphs in 
Figure 11-41 (page 492), where x is the controllable factor and z is the noise factor. In 
Figure 11 -4 la, there is no x X z interaction; therefore, there is no setting for the con¬ 
trollable variable x that will affect the variability transmitted to the response by the 
variability in z. However, in Figure ll-41b, there is a strong x X z interaction. Note that 
when x is set to its low level, there is much less variability in the response variable than 
when x is at the high level. Thus, unless there is at least one controllable factor X noise 
factor interaction, there is no robust design problem. As we will see in the next section, 
focusing on identifying and modeling these interactions is one of the keys to a more 
efficient and effective approach to RPD. 
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Figure 11-41 The role of the control X noise interaction in robust design. 


11-7.2 The Response Surface Approach to Robust Design 

As noted in the previous section, interactions between controllable and noise factors are 
the key to a robust design problem. Therefore, it is logical to use a model for the response 
that includes both controllable and noise factors and their interactions. To illustrate, 
suppose that we have two controllable factors x x and x 2 and a single noise factor z x . We 
assume that both control and noise factors are expressed as the usual coded variables 
(that is, they are centered at zero and have lower and upper limits at ±a). If we wish to 
consider a first-order model involving the controllable variables, a logical model is 

y = 1 3 0 + PiXi + J 82 T 2 + P12X1X2 + y\Z\ + &nX\Z\ + S 2 iX 2 z 1 + € (11-28) 

Notice that this model has the main effects of both controllable factors their interaction, 
the main effect of the noise variable, and both interactions between the controllable and 
noise variables. This type of model, incorporating both controllable and noise variables, 
is often called a response model. Unless at least one of the regression coefficients 5 n 
and <5 21 is nonzero, there will be no robust design problem. 

An important advantage of the response model approach is that both the controllable 
factors and the noise factors can be placed in a single experimental design; that is, the 
inner and outer array structure of the Taguchi approach can be avoided. We usually call 
the design containing both controllable and noise factors a combined array design. 

As mentioned previously, we assume that noise variables are random variables, al¬ 
though they are controllable for purposes of an experiment. Specifically, we assume that 
the noise variables are expressed in coded units, that they have expected value zero, 
variance a 2 , and if there are several noise variables, they have zero covariances. Under 
these assumptions, it is easy to find a model for the mean response just by taking the 
expected value of y in Equation 11-28. This yields 

E z (y) = Po + P\Xi + P2X2 + P12X1X2 

where the z subscript on the expectation operator is a reminder to take expected value 
with respect to both random variables in Equation 11-28, z, and e. To find a model for 
the variance of the response y, we use the transmission of error approach. First, expand 
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the response model Equation 11-28 in a first-order Taylor series around z, = 0. This 
gives 


y = y z = 0 + -f - (zi ~ 0) + R + e 

dzi 

= Po + PiXi + /3 2 x 2 + p l2 xix 2 
+ (7i + S, ] jc, + S 21 -r 2 )zi + 7? + e 

where /? is the remainder term in the Taylor series. As is the usual practice, we will 
ignore the remainder term. Now the variance of y can be obtained by applying the vari¬ 
ance operator across this last expression (without R). The resulting variance model is 

V z (y) = a 2 z (yi + 8 n x t + ^ X x 2 ) 2 + a 2 

Once again, we have used the z subscript on the variance operator as a reminder that 
both Zj and e are random variables. 

We have derived simple models for the mean and variance of the response variable 
of interest. Note the following: 

1. The mean and variance models involve only the controllable variables. This 
means we can potentially set the controllable variables to achieve a target value 
of the mean and minimize the variability transmitted by the noise variable. 

2. Although the variance model involves only the controllable variables, it also 
involves the interaction regression coefficients between the controllable and 
noise variables. This is how the noise variable influences the response. 

3. The variance model is a quadratic function of the controllable variables. 

4. The variance model (apart from a 2 ) is just the square of the slope of the fitted 
response model in the direction of the noise variable. 

To use these models operationally, we would 

1. Perform an experiment and fit an appropriate response model, such as Equation 
11-28 

2. Replace the unknown regression coefficients in the mean and variance models 
with their least squares estimates from the response model and replace a 2 in the 
variance model by the residual mean square found when fitting the response 
model 

3. Optimize the mean and variance model using the standard multiple response 
optimization methods discussed in Section 11-3.4 

It is very easy to generalize these results. Suppose that there are k controllable vari¬ 
ables and r noise variables. We will write the general response model involving these 
variables as 


y(x, z) = f(x) + h(x, z) + e (11-29) 

where /(x) is the portion of the model that involves only the controllable variables and 
h(x, z) are the terms involving the main effects of the noise factors and the interactions 
between the controllable and noise factors. Typically, the structure for h(x, z) is 
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h(x, Z) = 2 Ji z i + 2 2 

The structure for/(x) will depend on what type of model for the controllable variables 
the experimenter thinks is appropriate. The logical choices are the first-order model with 
interaction and the second-order model. If we assume that the noise variables have mean 
zero, variance a 2 , and zero covariances, and that the noise variables and the random 
errors e have zero covariances, then the mean model for the response is just 

£ z [y(x, z>] = f(x) (11-30) 


and the variance model for the response is 
V z {y(x, z)] 

Myers and Montgomery (1995) give a slightly more general form for Equation (11-31) 
based on applying a conditional variance operator directly to the response model. 


2 ^ 9y{x, z) j 


EXAMPLE 11-6 . 

To illustrate the foregoing procedure, reconsider Example 6-2 in which four factors were 
studied in a 2 4 factorial design to investigate their effect on the filtration rate of a chemical 
product. We will assume that factor A, temperature, is hard to control in the full-scale 
process, but it can be controlled during the experiment (which was performed in a pilot 
plant). The other three factors, pressure (B), concentration (C), and stirring rate (D), are 
easy to control. Thus the noise factor Zi is temperature, and the controllable variables * b 
x 2 , and * 3 are pressure, concentration, and stirring rate, respectively. Because both the 
controllable factors and the noise factor are in the same design, the 2 4 factorial design 
used in this experiment is an example of a combined array design. 

Using the results from Example 6-2, the response model is 



= 70.06 + 10.81Z! + 4.94*2 + 7.31* 3 - 9.06* 2 Zi + 8.31*^ 

Using Equations (11-30) and (11-31), we can find the mean and variance models as 
£ z [y(x, zi)] = 70.06 + 4.94* 2 + 7.31* 3 
and 

U z [y(x, z,)] = 0-2(10.81 - 9.06*2 + 8.31* 3 ) 2 + a 2 

= a 2 z ( 116.91 + 82.08*1 + 69.06*| - 195.88* 2 
+ 179.66* 3 - 150.58* 2 * 3 ) + a 2 

respectively. Now assume that the low and high levels of the noise variable temperature 
have been run at one standard deviation either side of its typical or average value, so that 
cr? = 1, and use & 2 = 19.51 (this is the residual mean square obtained by fitting the 
response model). Therefore, the variance model becomes 
V-[y(x, z : )] = 136.42 - 195.88* 2 + 179.66* 3 - 150.58* 2 * 3 + 82.08*1 + 69.06*1 
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x 4 = Strirring rate 

Figure 11-42 Contours of constant mean filtration rate. Example 11-6, with z, = tem¬ 
perature = 0. 


Figure 11-42 presents a contour plot from the Design-Expert software package of 
the response contours from the mean model. To construct this plot, we held the noise 
factor (temperature) at zero and the nonsignificant controllable factor (pressure) at zero. 
Notice that mean filtration rate increases as both concentration and stirring rate increase. 
Design-Expert will also construct plots automatically of the square root of the variance 
contours, which it labels propagation of error, or POE. Obviously, the POE is just the 
standard deviation of the transmitted variability in the response as a function of the 
controllable variables. Figure 11-43 (page 496) shows a contour plot and a three-dimen¬ 
sional response surface plot of the POE, obtained from Design-Expert (in this plot the 
noise variable is held constant at zero, as explained previously). 

Suppose that the experimenter wants to maintain a mean filtration rate about 75 and 
minimize the variability around this value. Figure 11-44 (page 497) shows an overlay 
plot of the contours of mean filtration rate and the POE as a function of concentration 
and stirring rate, the significant controllable variables. To achieve the desired objectives, 
it will be necessary to hold concentration at the high level and stirring rate very near the 
middle level. 


Example 11-6 illustrates the use of a first-order model with interaction as the model 
for the controllable factors,/(x). We now present an example adapted from Montgomery 
(1999) that involves a second-order model. 
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x 4 = Strirring rate 


Figure 11-44 Overlay plot of mean and POE contours for filtration rate. Example 11- 
6, with Zi temperature = 0. 

EXAMPLE 11-7 . 

An experiment was run in a semiconductor manufacturing facility involving two con¬ 
trollable variables and three noise variables. The combined array design used by the 
experimenters is shown in Table 11-23 on the next page. The design is a 23-run variation 
of a central composite design that was created by starting with a standard CCD for five 
factors (the cube portion is a 2 5_1 ) and deleting the axial runs associated with the three 
noise variables. This design will support a response model that has a second-order model 
in the controllable variables, the main effects of the three noise variables, and the inter¬ 
actions between the control and noise factors. The fitted response model is 
y(x, z) = 30.37 - 2.92*i - 4.13* 2 + 2.60*? + 2.18*2 + 2.87x,x 2 

+ 2.73z, - 2.33i 2 + 2.33z 3 - 0.27*iZi + 0.89* 1 z 2 + 2.58x,z 3 
+ 2.01* 2 z, - 1.43* 2 z 2 + 1.56* 2 z 3 
The mean and variance models are 

E z [y(x, z)] = 30.37 - 2.92*! - 4.13* 2 + 2.60*? + 2.18*1 + 2.87*!* 2 
and 

V z [y(x, z)] = 19.26 + 3.20*! + 12.45*2 + 7.52*? + 8.52*| + 2.21*!* 2 
where we have substituted parameter estimates from the fitted response model into the 
equations for the mean and variance models and, as in the previous example, assumed 
that <j\= 1. Figures 11-45 and 11-46 (pages 498-499) (from Design-Expert) present 
contour plots of the process mean and POE (remember POE is the square root of the 
variance response surface) generated from these models. 
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Table 11-23 Combined Array Experiment with Two Controllable Variables 



and Three Noise Variables, Example 

11-7 



Run Number 

*1: 

x 2 

Zi 

Z 2 

Z 3 

y 

1 

-1.00 

-1.00 

-1.00 

-1.00 

1.00 

44.2 

2 

1.00 

-1.00 

-1.00 

-1.00 

-1.00 

30.0 

3 

-1.00 

1.00 

-1.00 

-1.00 

-1.00 

30.0 

4 

1.00 

1.00 

-1.00 

-1.00 

1.00 

35.4 

5 

-1.00 

-1.00 

1.00 

-1.00 

-1.00 

49.8 

6 

1.00 

-1.00 

1.00 

-1.00 

1.00 

36.3 

7 

-1.00 

1.00 

1.00 

-1.00 

1.00 

41.3 

8 

1.00 

1.00 

1.00 

-1.00 

-1.00 

31.4 

9 

-1.00 

-1.00 

-1.00 

1.00 

-1.00 

43.5 

10 

1.00 

-1.00 

-1.00 

1.00 

1.00 

36.1 

11 

-1.00 

1.00 

-1.00 

1.00 

1.00 

22.7 

12 

1.00 

1.00 

-1.00 

1.00 

-1.00 

16.0 

13 

-1.00 

-1.00 

1.00 

1.00 

1.00 

43.2 

14 

1.00 

-1.00 

1.00 

1.00 

-1.00 

30.3 

15 

-1.00 

1.00 

1.00 

1.00 

-1.00 

30.1 

16 

1.00 

1.00 

1.00 

1.00 

1.00 

39.2 

17 

-2.00 

0.00 

0.00 

0.00 

0.00 

46.1 

18 

2.00 

0.00 

0.00 

0.00 

0.00 

36.1 

19 

0.00 

-2.00 

0.00 

0.00 

0.00 

47.4 

20 

0.00 

2.00 

0.00 

0.00 

0.00 

31.5 

21 

0.00 

0.00 

0.00 

0.00 

0.00 

30.8 

22 

0.00 

0.00 

0.00 

0.00 

0.00 

30.7 

23 

0.00 

0.00 

0.00 

0.00 

0.00 

31.0 



Figure 11-45 Contour plot of the mean model. Example 11-7. 
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In this problem it is desirable to keep the process mean below 30. From inspection 
of Figures 11-45 and 11-46 it is clear that some trade-off will be necessary if we wish 
to make the process variance small. Because there are only two controllable variables, a 
logical way to accomplish this trade-off is to overlay the contours of constant mean 
response and constant variance, as shown in Figure 11-47 on the previous page. This 
plot shows the contours for which the process mean is less than or equal to 30 and the 
process standard deviation is less than or equal to 5. The region bounded by these con¬ 
tours would represent a typical operating region of low mean response and low process 
variance. 


11-8 PROBLEMS - 

11-1. A chemical plant produces oxygen by liquifying air and separating it into its component 
gases by fractional distillation. The purity of the oxygen is a function of the main con¬ 
denser temperature and the pressure ratio between the upper and lower columns. Current 
operating conditions are temperature (&) = —220°C and pressure ratio (£ 2 ) = 1.2. Using 
the following data, find the path of steepest ascent: 


Temperature (£,) 

Pressure Ratio (f 2 ) 

Purity 

-225 

1.1 

82.8 

-225 

1.3 

83.5 

-215 

1.1 

84.7 

-215 

1.3 

85.0 

-220 

1.2 

84.1 

-220 

1.2 

84.5 

-220 

1.2 

83.9 

-220 

1.2 

84.3 


11-2. An industrial engineer has developed a computer simulation model of a two-item inven¬ 
tory system. The decision variables are the order quantity and the reorder point for each 
item. The response to be minimized is total inventory cost. The simulation model is used 
to produce the data shown in the following table. Identify the experimental design. Find 
the path of steepest descent. 


Item 1 

Item 2 

Total 

Cost 

Order 

Quantity (£,) 

Reorder 

Point (f 2 ) 

Order 

Quantity (fe) 

Reorder 
Point (&) 

100 

25 

250 

40 

625 

140 

45 

250 

40 

670 

140 

25 

300 

40 

663 

140 

25 

250 

80 

654 

100 

45 

300 

40 

648 

100 

45 

250 

80 

634 

100 

25 

300 

80 

692 

140 

45 

300 

80 

686 

120 

35 

275 

60 

680 

120 

35 

275 

60 

674 

120 

35 

275 

60 

681 




11-8 PROBLEMS 


501 


11-3. Verify that the following design is a simplex. Fit the first-order model, and find the path 
of steepest ascent. 


*1 

*2 

*3 

y 

0 

V2 

-1 

18.5 

-V2 

0 

1 

19.8 

0 

—V2 

-1 

17.4 

V2 

0 

1 

22.5 


11-4. For the first-order model 

y = 60 + 1.5*! - 0.8*2 + 2.0*3 

find the path of steepest ascent. The variables are coded as — 1 =£ *, =s 1. 

11-5. The region of experimentation for three factors are time (40 =£ T, 80 min), temperature 
(200 =£ J 2 « 300°C), and pressure (20 =S P =S 50 psig). A first-order model in coded 
variables has been fit to yield data from a 2 3 design. The model is 

y = 30 + 5*! + 2.5*2 + 3.5*3 

Is the point T, = 85, T 2 = 325, P = 60 on the path of steepest ascent? 

11-6. The region of experimentation for two factors are temperature (100 =£ T =£ 300°F) and 
catalyst feed rate (10 =£ C 30 lb/in). A first-order model in the usual ± 1 coded variables 
has been fit to a molecular weight response, yielding the following model: 

$ = 2000 + 125*, + 40*2 

(a) Find the path of steepest ascent. 

(b) It is desired to move to a region where molecular weights are above 2500. Based 
on the information you have from experimentation in this region, about how many 
steps along the path of steepest ascent might be required to move to the region of 
interest? 

11-7. The path of steepest ascent is usually computed assuming that the model is truly first- 
order; that is, there is no interaction. However, even if there is interaction, steepest ascent 
ignoring the interaction still usually produces good results. To illustrate, suppose that we 
have fit the model 

y = 20 + 5*, - 8*2 + 3*,* 2 
using coded variables (— 1 =£*, =£ +1). 

(a) Draw the path of steepest ascent that you would obtain if the interaction were 
ignored. 

(b) Draw the path of steepest ascent that you would obtain with the interaction included 
in the model. Compare this with the path found in part (a). 

11-8. The data shown in the following table were collected in an experiment to optimize crystal 
growth as a function of three variables *„ * 2 , and * 3 . Large values of y (yield in grams) 
are desirable. Fit a second-order model and analyze the fitted surface. Under what set of 
conditions is maximum growth achieved? 
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X\ _*2_ *3_ 


-1.682 0 

1.682 0 

0 -1.682 
0 1.682 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 


0 

0 

0 

0 

-1.682 

1.682 

0 

0 

0 

0 

0 

0 


>' 

66 

70 

78 

60 

80 

70 

100 

75 

100 

80 

68 

63 

65 

82 

113 

100 

118 

88 

100 

85 


11-9. The following data were collected by a chemical engineer. The response y is filtration 
time, Xj is temperature, and x 2 is pressure. Fit a second-order model. 


*i _ xi _ y_ 

-1 -1 54 

-1 1 45 

1 -1 32 

1 1 47 

-1.414 0 50 

1.414 0 53 

0 -1.414 47 

0 1.414 51 

0 0 41 

0 0 39 

0 0 44 

0 0 42 

0 0 40 


(a) What operating conditions would you recommend if the objective is to minimize 
the filtration time? 

(b) What operating conditions would you recommend if the objective is to operate the 
process at a mean filtration rate very close to 46? 

11-10. The hexagon design that follows is used in an experiment that has the objective of fitting 
a second-order model: 
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Xi 

*2 

y 

1 

0 

68 

0.5 

V0J5 

74 

-0.5 

V0?75 

65 

-1 

0 

60 

-0.5 

—Vo/75 

63 

0.5 

—V0J5 

70 

0 

0 

58 

0 

0 

60 

0 

0 

57 

0 

0 

55 

0 

0 

69 


(a) Fit the second-order model. 

(b) Perform the canonical analysis. What type of surface has been found? 

(c) What operating conditions on x x and x 2 lead to the stationary point? 

(d) Where would you run this process if the objective is to obtain a response that is as 
close to 65 as possible? 

11-11. An experimenter has run a Box-Behnken design and has obtained the results below, 
where the response variable is the viscosity of a polymer: 


Level 

Temp. 

Agitation 

Rate 

Pressure 

Xl 

x 2 

*3 

High 

200 

10.0 

25 

+ 1 

+ 1 

+ 1 

Middle 

175 

7.5 

20 

0 

0 

0 

Low 

150 

5.0 

15 

-1 

-1 

-1 


Run x, x 2 


2l 


2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


-1 -1 

+1 -1 

-1 +1 

+ 1 +1 

-1 0 

+ 1 0 

-1 0 

+ 1 0 

0 -1 

0 +1 

0 -1 

0 +1 

0 0 

0 0 

0 0 


0 

0 

0 

0 


0 

0 

0 


535 

580 

596 

563 

645 

458 

350 

600 

595 

648 

532 

656 

653 

599 

620 


(a) Fit the second-order model. 

(b) Perform the canonical analysis. What type of surface has been found? 
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(c) What operating conditions on x,, x 2 , and x 3 lead to the stationary point? 

(d) What operating conditions would you recommend if it is important to obtain a 
viscosity that is as close to 600 as possible? 

11-12. Consider the three-variable central composite design shown below. Analyze the data and 
draw conclusions, assuming that we wish to maximize conversion (_y,) with activity (y 2 ) 
between 55 and 60. 


Run 

Time 

Temperature 

(°C) 

Catalyst 

(%) 

Conversion 

(%) 

yi 

Activity 

?2 

1 

-1.000 

-1.000 

-1.000 

74.00 

53.20 

2 

1.000 

-1.000 

-1.000 

51.00 

62.90 

3 

-1.000 

1.000 

-1.000 

88.00 

53.40 

4 

1.000 

1.000 

-1.000 

70.00 

62.60 

5 

-1.000 

-1.000 

1.000 

71.00 

57.30 

6 

1.000 

-1.000 

1.000 

90.00 

67.90 

7 

-1.000 

1.000 

1.000 

66.00 

59.80 

8 

1.000 

1.000 

1.000 

97.00 

67.80 

9 

0.000 

0.000 

0.000 

81.00 

59.20 

10 

0.000 

0.000 

0.000 

75.00 

60.40 

11 

0.000 

0.000 

0.000 

76.00 

59.10 

12 

0.000 

0.000 

0.000 

83.00 

60.60 

13 

-1.682 

0.000 

0.000 

76.00 

59.10 

14 

1.682 

0.000 

0.000 

79.00 

65.90 

15 

0.000 

-1.682 

0.000 

85.00 

60.00 

16 

0.000 

1.682 

0.000 

97.00 

60.70 

17 

0.000 

0.000 

-1.682 

55.00 

57.40 

18 

0.000 

0.000 

1.682 

81.00 

63.20 

19 

0.000 

0.000 

0.000 

80.00 

60.80 

20 

0.000 

0.000 

0.000 

91.00 

58.90 


11-13. A manufacturer of cutting tools has developed two empirical equations for tool life in 
hours (y,) and for tool cost in dollars (y 2 ). Both models are linear functions of steel 
hardness (jq) and manufacturing time (x 2 ). The two equations are 

yj = 10 + 5*! + 2 x 2 
y 2 = 23 + 3jc, + 4x 2 

and both equations are valid over the range —1.5 *£ x, < 1.5. Unit tool cost must be 
below $27.50 and life must exceed 12 hours for the product to be competitive. Is there 
a feasible set of operating conditions for this process? Where would you recommend 
that the process be run? 

11-14. A central composite design is run in a chemical vapor deposition process, resulting in 
the experimental data shown on the next page. Four experimental units were processed 
simultaneously on each run of the design, and the responses are the mean and variance 
of thickness, computed across the four units. 
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X, _*2_ l _ 

-1 -1 360.6 

1 -1 445.2 

-1 1 412.1 

1 1 601.7 

1.414 0 518.0 

-1.414 0 411.4 

0 1.414 497.6 

0 -1.414 397.6 

0 0 530.6 

0 0 495.4 

0 0 510.2 

0 0 487.3 


6.689 

14.230 

7.088 

8.586 

13.130 

6.644 

7.649 

11.740 

7.836 

9.306 

7.956 

9.127 


(a) Fit a model to the mean response. Analyze the residuals. 

(b) Fit a model to the variance response. Analyze the residuals. 

(c) Fit a model to ln(s 2 ). Is this model superior to the one you found in part (b)? 

(d) Suppose you want the mean thickness to be in the interval 450 ± 25. Find a set of 
operating conditions that achieve this objective and simultaneously minimize the 
variance. 

(e) Discuss the variance minimization aspects of part (d). Have you minimized the 
total process variance? 

11-15. Verify that an orthogonal first-order design is also first-order rotatable. 

11-16. Show that augmenting a 2* design with n c center points does not affect the estimates of 
the ^ (i = 1, 2,..., k) but that the estimate of the intercept j8 0 is the average of all 
2* + n c observations. 

11-17. The rotatable central composite design. It can be shown that a second-order design is 
rotatable if 2"= = 0 if a or b (or both) are odd and if 2"= ixf u = 32”= ixj u x%. Show 

that for the central composite design these conditions lead to a = (n F ) U4 for rotatability, 
where n F is the number of points in the factorial portion. 

11-18. Verify that the central composite design shown below blocks orthogonally: 


Block 1 


Block 2 


Block 3 


Xi _*2_ x 3 

0 0 0 

0 0 0 

1 1 1 

1 -1 -1 

-1 -1 1 

-1 1 -1 


0 0 0 

0 0 0 

1 1 -1 

1 -1 1 

-1 1 1 

-1 -1 -1 


*1 _ X 2 

-1.633 0 

1.633 0 

0 -1.633 

0 1.633 

0 0 

0 0 

0 0 

0 0 


*3 

0 

0 

0 

0 

-1.633 

1.633 

0 

0 


11-19. Blocking in the central composite design. Consider a central composite design for k = 
4 variables in two blocks. Can a rotatable design always be found that blocks 
orthogonally? 

11-20. How could a hexagon design be run in two orthogonal blocks? 
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11-21. Yield during the first four cycles of a chemical process is shown in the following table. 
The variables are percentage of concentration (*,) at levels 30, 31, and 32 and temper¬ 
ature (x 2 ) at 140, 142, and 144T. Analyze by EVOP methods. 


Cycle 



Conditions 



0) 

(2) 

(3) 

(4) 

(5) 

1 

60.7 

59.8 

60.2 

64.2 

57.5 

2 

59.1 

62.8 

62.5 

64.6 

58.3 

3 

56.6 

59.1 

59.0 

62.3 

61.1 

4 

60.5 

59.8 

64.5 

61.0 

60.1 


11-22. Suppose that we approximate a response surface with a model of order d u such as y - 
X,/3, + e, when the true surface is described by a model of order d 2 > d-c, that is, 
E{ y) = XA + X.ft, 

(a) Show that the regression coefficients are biased, that is, that E(fii) - Pi + A p 2 , 
where A = (X ] 'Xi)^ 1 X;X 2 . A is usually called the alias matrix. 

(b) If di = 1 and d 2 = 2, and a full 2* is used to fit the model, use the result in part 
(a) to determine the alias structure. 

(c) If di = 1, d 2 = 2, and k = 3, find the alias structure assuming that a 2 3-1 design 
is used to fit the model. 

(d) If d x = 1, d 2 = 2, k = 3, and the simplex design in Problem 11-3 is used to fit the 
model, determine the alias structure and compare the results with part (c). 

11-23. In an article (“Let’s All Beware the Latin Square,” Quality Engineering, Vol. 1,1989, 
pp. 453-465), J. S. Hunter illustrates some of the problems associated with 3*" p fractional 
factorial designs. Factor A is the amount of ethanol added to a standard fuel and factor 
B represents the air/fuel ratio. The response variable is carbon monoxide (CO) emission 
in g/m 3 . The design is shown below: 



Design 


Observations 

y 

A 

B 


*2 

0 

0 

-1 

-l 

66 

62 

1 

0 

0 

-l 

78 

81 

2 

0 

+ 1 

-l 

90 

94 

0 

1 

-1 

0 

72 

67 

1 

1 

0 

0 

80 

81 

2 

1 

+1 

0 

75 

78 

0 

2 

-1 

+1 

68 

66 

1 

2 

0 

+1 

66 

69 

2 

2 

+1 

+1 

60 

58 


Notice that we have used the notation system of 0,1, and 2 to represent the low, medium, 
and high levels for the factors. We have also used a “geometric notation” of -1, 0, and 
+ 1. Each run in the design is replicated twice. 

(a) Verify that the second-order model 

y= 78.5 + 4.5x, - 7.0* 2 - 4.5 x\ - 4.0*| - 9.0*!* 2 
is a reasonable model for this experiment. Sketch the CO concentration contours 
in the *,, x 2 space. 
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(b) Now suppose that instead of only two factors, we had used four factors in a 3 4-2 
fractional factorial design and obtained exactly the same data as in part (a). The 
design would be as follows: 


ABC 
0 0 0 
1 0 1 
2 0 2 
0 1 2 
1 1 0 
2 1 1 
0 2 1 
1 2 2 
2 2 0 


D _ 

0 -1 

1 0 

2 +1 

1 -1 

2 0 

0 +1 

2 -1 

0 0 

1 +1 


0 

0 

0 

+ 1 


Observations 


-1 -1 66 

0 0 78 

+ 1 +1 90 

+ 1 0 72 

-1 +1 80 

0 -1 75 

0 +1 68 

+ 1 -1 66 

-1 0 60 


62 

81 

94 

67 

81 

78 

66 

69 

58 


Confirm that this design is an L 9 orthogonal array. 

(c) Calculate the marginal averages of the CO response at each level of the four factors 
A, B, C, and D. Construct plots of these marginal averages and interpret the results. 
Do factors C and D appear to have strong effects? Do these factors really have any 
effect on CO emission? Why is their apparent effect strong? 

(d) The design in part (b) allows the model 

y = 0o + E 0* + E Puxt + e 

to be fitted. Suppose that the true model is 

y = 0o + S A*, + E Pu*t + EE frjXiXj = e 

Show that if 0, represents the least squares estimates of the coefficients in the fitted 
model, then 


£(J8 0 ) = A> - 13,3 - j3 I4 - 
£(0,) = 0, - (023 + 024)/2 
£(0 2 ) = 02 - (013 + 014 + 034)/2 
£(03) = 03 - (012 + 024)/2 
£(04) = 04 - (012 + 023)/2 
£(0l,) = 0,1 - (023 - 02 4 )/2 
£(022) = 022 + (0,3 + 0,4 + 034)/2 
£(033) = 033 - (024 - 0, 2 )/2 + 0,4 
£(044) = 044 - (0,2 - 02 3 )/2 + 0,3 

Does this help explain the strong effects for factors C and D observed graphically 
in part (c)? 

11-24. Suppose that you need to design an experiment to lit a quadratic model over the region 
-1 =s jc, =s +1, / = 1,2 subject to the constraint jc, + x 2 =£ 1. If the constraint is violated. 
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the process will not work properly. You can afford to make no more than n = 12 runs. 
Set up the following designs: 

(a) An “inscribed” CCD with center point at x x = x 2 = 0. 

(b) An “inscribed” 3 2 factorial with center point at x x = x 2 = —0.25. 

(c) A D-optimal design. 

(d) A modified D-optimal design that is identical to the one in part (c), but with all 
replicate runs at the design center. 

(e) Evaluate the | (X'X) 1 1 criterion for each design. 

(f) Evaluate the D-efficiency for each design relative to the D-optimal design in 
part (c). 

(g) Which design would you prefer? Why? 

11-25. Consider a 2 3 design for fitting a first-order model. 

(a) Evaluate the D-criterion | (X'X) -1 1 for this design. 

(b) Evaluate the A-criterion tr(X'X) -1 for this design. 

(c) Find the maximum scaled prediction variance for this design. Is this design 
G-optimal? 

11-26. Repeat Problem 11-25 using a first-order model with the two-factor interactions. 

11-27. A chemical engineer wishes to fit a calibration curve for a new procedure used to measure 
the concentration of a particular ingredient in a product manufactured in his facility. 
Twelve samples can be prepared, having known concentration. The engineer wants to 
build a model for the measured concentrations. He suspects that a linear calibration curve 
will be adequate to model the measured concentration as a function of the known con¬ 
centrations; that is, y = j3 0 + P\X + e, where x is the actual concentration. Four exper¬ 
imental designs are under consideration. Design 1 consists of six runs at known 
concentration 1 and six runs at known concentration 10. Design 2 consists of four runs 
at concentrations 1, 5.5, and 10. Design 3 consists of three runs at concentration 1, 4, 7, 
and 10. Finally, design 4 consists of three runs at concentrations 1, and 10 and six runs 
at concentration 5.5. 

(a) Plot the scaled variance of prediction for all four designs on the same graph over 
the concentration range 1 =£ x =s 10. Which design would be preferable? 

(b) For each design calculate the determinant of (X'X) -1 . Which design would be 
preferred according to the D-criterion? 

(c) Calculate the D-efficiency of each design relative to the “best” design that you 
found in part (b). 

(d) For each design, calculate the average variance of prediction over the set of points 
given by x = 1, 1.5, 2, 2.5,.... 10. Which design would you prefer according to 
the E-criterion? 

(e) Calculate the E-efficiency of each design relative to the best design that you found 
in part (d). 

(f) What is the G-efficiency of each design? 

11-28. Rework problem 11-27 assuming that the model the engineer wishes to fit is a quadratic. 

Obviously, only designs 2, 3, and 4 can now be considered. 

11-29. An experimenter wishes to run a three-component mixture experiment. The constraints 
in the component proportions are as follows: 

0.2 =s x, « 0.4 
0.1 =£ x 2 =£ 0.3 
0.4 =£ *£ 0.7 

(a) Set up an experiment to fit a quadratic mixture model. Use n = 14 runs, with four 
replicates. Use the D-criterion. 

(b) Draw the experimental region. 
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(c) Set up an experiment to fit a quadratic mixture model with n = 12 runs, assuming 
that three of these runs are replicates. Use the D-criterion. 

(d) Comment on the two designs you have found. 

11-30. Myers and Montgomery (1995) describe a gasoline blending experiment involving three 
mixture components. There are no constraints on the mixture proportions, and the fol¬ 
lowing 10-run design is used: 


Design Point 

X, 

*2 

*3 

y, mi/gal 

1 

1 

0 

0 

24.5, 25.1 

2 

0 

1 

0 

24.8, 23.9 

3 

0 

0 

1 

22.7, 23.6 

4 

i 

4 

0 

25.1 

5 

l 

0 

4 

24.3 

6 

0 

4 

1 

23.5 

7 

4 

4 

4 

24.8, 24.1 

8 

| 

4 

4 

24.2 

9 

| 

§ 

4 

23.9 

10 

I 

8 

1 

23.7 


(a) What type of design did the experimenters use? 

(b) Fit a quadratic mixture model to the data. Is this model adequate? 

(c) Plot the response surface contours. What blend would you recommend to maximize 
the miles per gallon? 

11-31. Consider the bottle-filling experiment in Example 6-1. Suppose that the percentage of 
carbonation (A) is a noise variable (in coded units of = 1). 

(a) Fit the response model to these data. Is there a robust design problem? 

(b) Find the mean model and either the variance model or the POE. 

(c) Find a set of conditions that result in mean fill deviation as close to zero as possible 
with minimum transmitted variance. 

11-32. Consider the experiment in Problem 11-12. Suppose that temperature is a noise variable 
(of = 1 in coded units). Fit response models for both responses. Is there a robust design 
problem with respect to both responses? Find a set of conditions that maximize conver¬ 
sion with activity between 55 and 60 and that minimize the variability transmitted from 
temperature. 

11-33. An experiment has been run in a process that applies a coating material to a wafer. Each 
run in the experiment produced a wafer, and the coating thickness was measured several 
times at different locations on the wafer. Then the mean y[ and standard deviation y 2 of 
the thickness measurement was obtained. The data [adapted from Box and Draper (1987)] 
are shown in the table below: 


Mean Std. Dev. 

Run Speed Pressure Distance y, y 2 


-1.000 

0.000 

1.000 

-1.000 

0.000 

1.000 

-1.000 


-1.000 

-1.000 

-1.000 

0.000 

0.000 

0.000 

1.000 


-1.000 

-1.000 

-1.000 

-1.000 

-1.000 

-1.000 

-1.000 


24.0 

120.3 
213.7 

86.0 

136.6 

340.7 

112.3 


12.5 

8.4 
42.8 

3.5 
80.4 
16.2 

27.6 
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Run 

Speed 

Pressure 

Distance 

yi 

Std. Dev. 

>2 

8 

0.000 

1.000 

-1.000 

256.3 

4.6 

9 

1.000 

1.000 

-1.000 

271.7 

23.6 

10 

-1.000 

-1.000 

0.000 

81.0 

0.0 

11 

0.000 

-1.000 

0.000 

101.7 

17.7 

12 

1.000 

-1.000 

0.000 

357.0 

32.9 

13 

-1.000 

0.000 

0.000 

171.3 

15.0 

14 

0.000 

0.000 

0.000 

372.0 

0.0 

15 

1.000 

0.000 

0.000 

501.7 

92.5 

16 

-1.000 

1.000 

0.000 

264.0 

63.5 

17 

0.000 

1.000 

0.000 

427.0 

88.6 

18 

1.000 

1.000 

0.000 

730.7 

21.1 

19 

-1.000 

-1.000 

1.000 

220.7 

133.8 

20 

0.000 

-1.000 

1.000 

239.7 

23.5 

21 

1.000 

-1.000 

1.000 

422.0 

18.5 

22 

-1.000 

0.000 

1.000 

199.0 

29.4 

23 

0.000 

0.000 

1.000 

485.3 

44.7 

24 

1.000 

0.000 

1.000 

673.7 

158.2 

25 

-1.000 

1.000 

1.000 

176.7 

55.5 

26 

0.000 

1.000 

1.000 

501.0 

138.9 

27 

1.000 

1.000 

1.000 

1010.0 

142.4 


(a) What type of design did the experimenters use? Is this a good choice of design for 
fitting a quadratic model? 

(b) Build models of both responses. 

(c) Find a set of optimum conditions that result in the mean as large as possible with 
the standard deviation less than 60. 

11-34. A variation of Example 6-2. In Example 6-2 we found that one of the process variables 
(B = pressure) was not important. Dropping this variable produces two replicates of a 
2 3 design. The data are shown below: 


C 

D 

A (+) 

A (—) 

y s 2 

- 

- 

45,48 

71,65 

57.75 

121.19 

+ 

- 

68, 80 

60,65 

68.25 

72.25 


+ 

43,45 

100, 104 

73.00 

1124.67 

■ 

+ 

75,70 

86, 96 

81.75 

134.92 


Assume that C and D are controllable factors and that A is a noise variable. 

(a) Fit a model to the mean response. 

(b) Fit a model to the ln(s 2 ) response. 

(c) Find operating conditions that result in the mean filtration rate response exceeding 
75 with minimum variance. 

(d) Compare your results with those from Example 11-6, which used the transmission 
of error approach. How similar are the two answers? 





Experiments 
with Random 
Factors 


Throughout most of this book we have assumed that the factors in an experiment were 
fixed factors, that is, the levels of the factors used by the experimenter were the specific 
levels of interest. The implication of this, of course, is that the statistical inferences made 
about these factors is confined to the specific levels studied. That is, if three material 
types are investigated as in the battery life experiment of Example 5-1, our conclusions 
are valid only about those specific material types. A variation of this occurs when the 
factor or factors are quantitative. In these situations, we often use a regression model 
relating the response to the factors to predict the response over the region spanned by 
the factor levels used in the experimental design. Several examples of this were presented 
in Chapters 5 through 9. In general, with a fixed effect, we say that the inference space 
of the experiment is the specific set of factor levels investigated. 

In some experimental situations, the factor levels are chosen at random from a larger 
population of possible levels, and the experimenter wishes to draw conclusions about the 
entire population of levels, not just those that were used in the experimental design. In 
this situation the factor is said to be a random factor. We begin with a simple situation, 
a single-factor experiment where the factor is random and use this to introduce the ran¬ 
dom effects model for the analysis of variance and components of variance. Random 
factors also occur regularly in factorial experiments as well as in other types of experi¬ 
ments. In this chapter we focus on methods for the design and analysis of factorial 
experiments with random factors. In Chapter 13 we will present nested and split-plot 
designs, two situations where random factors are frequently encountered in practice. 


12-1 THE RANDOM EFFECTS MODEL 

An experimenter is frequently interested in a factor that has a large number of possible 
levels. If the experimenter randomly selects a of these levels from the population of 
factor levels, then we say that the factor is random. Because the levels of the factor 
actually used in the experiment were chosen randomly, inferences are made about the 
entire population of factor levels. We assume that the population of factor levels is either 
of infinite size or is large enough to be considered infinite. Situations in which the pop- 
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ulation of factor levels is small enough to employ a finite population approach are not 
encountered frequently. Refer to Bennett and Franklin (1954) and Searle and Fawcett 
(1970) for a discussion of the finite population case. 

The linear statistical model is 

+ + <“-» 

where both r, and e, y are random variables. If t, has variance <x? and is independent of 
€y, the variance of any observation is 

V(y t j ) = cr 2 r + (T 2 

The variances <t 2 and a 2 are called variance components, and the model (Equation 12- 
1) is called the components of variance or random effects model. To test hypotheses 
in this model, we require that the {e,-, } are NID(0, cr 2 ), that the {r, } are NID(0, a 2 ), and 
that T t and e,y are independent. 1 
The sum of squares identity 

SS T = SS TreMments + SS E (12-2) 

is still valid. That is, we partition the total variability in the observations into a component 
that measures the variation between treatments (SS Tr eatments) and a component that mea¬ 
sures the variation within treatments (SS E ). Testing hypotheses about individual treatment 
effects is meaningless, so instead we test hypotheses about the variance component a 2 : 

Ho'.cr 2 = 0 (12-3) 

H x :a 2 T > 0 

If a 2 = 0, all treatments are identical; but if a 2 > 0, variability exists between treatments. 
As before, SS E /a 2 is distributed as chi-square with N - a degrees of freedom and, under 
the null hypothesis, SS Tieaxmen Ja- 2 is distributed as chi-square with a - 1 degrees of 
freedom. Both random variables are independent. Thus, under the null hypothesis cr? = 
0, the ratio 


^Treatments 

Q 1 _ MiSxreatments 

SS E - MS e 
N - a 


(12-4) 


is distributed as F with a — 1 and N — a degrees of freedom. However, we need to 
examine the expected mean squares to fully describe the test procedure. 

Consider 



When squaring and taking expectation of the quantities in brackets, we see that terms 
involving r? are replaced by a 2 as E{ t ; ) = 0. Also, terms involving e?, e?, and 


1 The assumption that the { tJ are independent random variables implies that the usual assumption of 2f =1 r, = 0 
from the fixed effects model does not apply to the random effects model. 
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2" =1 2" = iT? are replaced by na 2 , ana 2 , and an 2 a 2 , respectively. Furthermore, all cross- 
product terms involving r, and e, 7 have zero expectation. This leads to 

£(M.S' Tr eatments) = -~~ [Nfi? + N(t\ + Off 2 ~ N jU. 2 — HCT 2 ~ O’ 2 ] 

or 

£(MS Treatments ) = O’ 2 + 7U7 2 (12-5) 

Similarly, we may show that 

E(MS e ) = a 2 (12-6) 

From the expected mean squares, we see that under H 0 both the numerator and 
denominator of the test statistic (Equation 12-4) are unbiased estimators of a 2 , whereas 
under H x the expected value of the numerator is greater than the expected value of the 
denominator. Therefore, we should reject H 0 for values of F 0 that are too large. This 
implies an upper-tail, one-tail critical region, so we reject H 0 if F 0 

The computational procedure and analysis of the variance table for the random ef¬ 
fects model are identical to those for the fixed effects case. The conclusions, however, 
are quite different because they apply to the entire population of treatments. 

We are usually interested in estimating the variance components (cr 2 and a 2 ) in the 
model. The procedure that we use to estimate cr 2 and <r 2 is called the analysis of variance 
method because it makes use of the lines in the analysis of variance table. The procedure 
consists of equating the expected mean squares to their observed values in the analysis 
of variance table and solving for the variance components. In equating observed and 
expected mean squares in the single-factor random effects model, we obtain 

A/5 Trea .men.s = OT 2 + HCT 2 

and 

MS e = a 2 

Therefore, the estimators of the variance components are 

& 2 = MS e (12-7) 


2 = MS Treatment , - MS e 
n 

For unequal sample sizes, replace n in Equation 12-8 by 




( 12 - 8 ) 


(12-9) 


The analysis of variance method of variance component estimation does not require 
the normality assumption. It does yield estimators of or 2 and cr 2 that are best quadratic 
unbiased (i.e., of all unbiased quadratic functions of the observations, these estimators 
have minimum variance). 
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Occasionally, the analysis of variance method produces a negative estimate of a 
variance component. Clearly, variance components are by definition nonnegative, so a 
negative estimate of a variance component is viewed with some concern. One course of 
action is to accept the estimate and use it as evidence that the true value of the variance 
component is zero, assuming that sampling variation led to the negative estimate. This 
has intuitive appeal, but it suffers from some theoretical difficulties. For instance, using 
zero in place of the negative estimate can disturb the statistical properties of other esti¬ 
mates. Another alternative is to reestimate the negative variance component using a 
method that always yields nonnegative estimates. Still another alternative is to consider 
the negative estimate as evidence that the assumed linear model is incorrect and reex¬ 
amine the problem. Comprehensive treatment of variance component estimation is given 
by Searle (1971a, 1971b), Searle, Casella, and McCullogh (1992), and Burdick and Gray- 
bill (1990). 


EXAMPLE 12-1 . 

A textile company weaves a fabric on a large number of looms. It would like the looms 
to be homogeneous so that it obtains a fabric of uniform strength. The process engineer 
suspects that, in addition to the usual variation in strength within samples of fabric from 
the same loom, there may also be significant variations in strength between looms. To 
investigate this, she selects four looms at random and makes four strength determinations 
on the fabric manufactured on each loom. This experiment is run in random order, and 
the data obtained are shown in Table 12-1. The analysis of variance is conducted and is 
shown in Table 12-2 on the facing page. From the analysis of variance, we conclude that 
the looms in the plant differ significantly. 

The variance components are estimated by & 2 = 1.90 and 


&l 


29.73 - 1.90 = 6 96 
4 


Therefore, the variance of any observation on strength is estimated by 
& T = & 2 + &l= 1.90 + 6.96 = 8.86. 

Most of this variability is attributable to differences between looms. 


This example illustrates an important use of variance components—the isolating of 
different sources of variability that affect a product or system. The problem of product 


Table 12-1 Strength Data for Example 12-1 _ 

Observations 

Looms _1_2_3_4_ y^ 

1 98 97 99 96 390 

2 91 90 93 92 366 

3 96 95 97 95 383 

4 95 96 99 98 388 


1527 = y. 
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Table 12-2 Analysis of Variance for the Strength Data 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

F-Value 

Looms 

89.19 

3 

29.73 

15.68 

<0.001 

Error 

22.75 

12 

1.90 



Total 

111.94 

15 





variability frequently arises in quality assurance, and it is often difficult to isolate the 
sources of variability. For example, this study may have been motivated by too much 
variability in the strength of the fabric, as illustrated in Figure 12-la. This graph displays 
the process output (fiber strength) modeled as a normal distribution with variance & y = 
8.86. (This is the estimate of the variance of any observation on strength from Example 
12-1.) Upper and lower specifications on strength are also shown in Figure 12-la, and 
it is relatively easy to see that a fairly large proportion of the process output is outside 
the specifications (the shaded tail areas in Figure 12-la). The process engineer has asked 
why so much fabric is defective and must be scrapped, reworked, or downgraded to a 
lower quality product. The answer is that most of the product strength variability is the 
result of differences between looms. Different loom performance could be the result of 
faulty setup, poor maintenance, ineffective supervision, poorly trained operators, defec¬ 
tive input fiber, and so forth. 

The process engineer must now try to isolate the specific causes of the difference in 
loom performance. If she could identify and eliminate these sources of between-loom 
variability, the variance of the process output could be reduced considerably, perhaps to 
as low as & 2 y = 1.90, the estimate of the within-loom (error) variance component in 
Example 12-1. Figure 12-16 shows a normal distribution of fiber strength with & 2 y = 
1.90. Note that the proportion of defective product in the output has been dramatically 
reduced. Although it is unlikely that all of the between-loom variability can be elimi¬ 
nated, it is clear that a significant reduction in this variance component would greatly 
increase the quality of the fiber produced. 



ls us 

(a) Variability of process output. 



(6) Variability of process output if = 0. 

Figure 12-1 Process output in the fiber strength problem. 
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We may easily find a confidence interval for the variance component a 2 . If the 
observations are normally and independently distributed, then (N — a)MS E Jcr 2 is distrib¬ 
uted as Xn-o- Thus, 

(N - a)MS E 


E Xlw-a j = 1 - a 


and a 100(1 - a) percent confidence interval for a 2 is 

(N - d)MS E < < W ~ °) MS e 

Xa/2 JJ-a X 1 —(ot/2)JV—a 

Now consider the variance component a 2 . The point estimator of cr 2 is 

,2 MS Treatments “ MS E 


The random variable ( a - l)MS Tieatmen J(a 2 + na 2 T ) is distributed as xl-u and 
(N - a)MS E /a 2 is distributed as xl-a- Thus, the probability distribution of & 2 r is a linear 
combination of two chi-square random variables, say 


n(a - 1) 


n(N - a) 


Unfortunately, a closed-form expression for the distribution of this linear combination 
of chi-square random variables cannot be obtained. Thus, an exact confidence interval 
for a 2 cannot be constructed. Approximate procedures are given in Graybill (1961) and 
Searle (1971a). Also see Section 12-7. 

It is easy to find an exact expression for a confidence interval on the ratio 
a 2 T /(al + a 2 ). This is a meaningful ratio because it reflects the proportion of the variance 
of an observation [recall that V(y, y ) = a 2 + a 2 ] that is the result of differences between 
treatments. To develop this confidence interval for the case of a balanced design, note 
that MS Treatme nts and MS E are independent random variables and, furthermore, that it can 
be shown that 

MS Tieatmen J(ncr 2 T + a 2 ) _ 

MSJct 2 a ~ UN “ 


By rearranging Equation 12-11, we may obtain the following: 


= 1 - a (12-11) 


( 12 - 12 ) 


1 (MS 


(12-13a) 
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and 



n 


/ ^Treatments _ \_ 

\ MS e F, _ ^ 


") 


(1213b) 


Note that L and JJ are 100(1— a) percent lower and upper confidence limits, respec¬ 
tively, for the ratio a 2 Ja 2 . Therefore, a 100(1 - a) percent confidence interval for 
o- 2 T /(a 2 T + a 2 ) is 


L a 2 < U 

l + L" a 2 T + a 2 " l + U 


(12-14) 


To illustrate this procedure, we find a 95 percent confidence interval on cr;/(<rj + 
a 2 ) for the strength data in Example 12-1. Recall that AfS Treatments = 29.73, MS E = 1.90, 
a = 4, n = 4, Fo.025,3,12 = 4.47, and F 0 . 975i312 = l/F 0 .o 2 s,i2,3 = 1/14.34 = 0.070. Therefore, 
from Equation 12- 13a and b, 

»-i[(S)(d5) -I-”- 

and from Equation 12-12, the 95 percent confidence interval on al/icr 2 + a 2 ) is 
0.625 < <t 2 < 54.883 

1.625 " <t 2 + a 2 ^ 55.883 


or 


0.39 =s 2 °~ t , ^ 0.98 
cr T + tr 

We conclude that variability between looms accounts for between 39 and 98 percent of 
the variance in the observed strength of the fabric produced. This confidence interval is 
relatively wide because of the small sample size that was used in the experiment. Clearly, 
however, the variability between looms ( a 2 ) is not negligible. 


12-2 THE TWO-FACTOR FACTORIAL WITH RANDOM FACTORS 


Suppose that we have two factors, A and B, and that both factors have a large number 
of levels that are of interest (as in the previous section, we will assume that the number 
of levels is infinite). We will choose at random a levels of factor A and b levels of factor 
B and arrange these factor levels in a factorial experimental design. If the experiment is 
replicated n times, we may represent the observations by the linear model 


y.jk = >i + T t + $ + (t/3)„ + e ijk 



(12-15) 


where the model parameters t,, fy, (t/?),-,, and e iJk are all independent random variables. 
We are also going to assume that the random variables t„ fij, (rj8), y , and e ijk are normally 
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distributed with mean zero and variances given by V(t,) — a 2 , H(/3 7 ) — <x|, V^[(r^) /y ] — 
o-L, and K(e iyJfc ) = <r 2 . Therefore the variance of any observation is 

V{y ijk ) = a 2 + o% + a 2 Tf) + a 2 (12-16) 

and a 2 , a%, a 2 T(3 , and cr 2 are the variance components. The hypotheses that we are 
interested in testing are H 0 :a 2 T = 0, H 0 :a% = 0, and H 0 :a 2 r p = 0. Notice the similarity 
to the single-factor random effects model. 

The numerical calculations in the analysis of variance remains unchanged; that is, 
SS A , SS b , SS ab , SS t , and SS E are all calculated as in the fixed effects case. However, to 
form the test statistics, we must examine the expected mean squares. It may be shown 
that 

E(MS a ) = a 2 + ncr 2 T p + bna 2 
E(MS b ) = a 2 + ncr 2 Tl3 + ana\ 

E(MS ab ) = a 2 + nalp (12-17) 


and 

E(MS e ) = a 2 


From the expected mean squares, we see that the appropriate statistic for testing the 
no-interaction hypothesis H 0 : a 2 p = 0 is 


F 0 


MS ab 

MS e 


(12-18) 


because under H 0 both numerator and denominator of F 0 have expectation a 2 , and only 
if H 0 is false is E(MS AB ) greater than E(MS e ). The ratio F 0 is distributed as 
F {a -i)( h -i),ab{ n -\y Similarly, for testing H 0 : a 2 = 0 we would use 


F 0 


MS a 

MS ab 


(12-19) 


which is distributed as F a _i, (a _i)(*-i), and for testing H 0 :a 2 p — 0 the statistic is 


F 0 


MS b 

MS ab 


( 12 - 20 ) 


which is distributed as F b - Ua - lxb - iy These are all upper-tail, one-tail tests. Notice that 
these test statistics are not the same as those used if both factors A and B are fixed. The 
expected mean squares are always used as a guide to test statistic construction. 

In many experiments involving random factors, interest centers at least as much on 
estimating the variance components as on hypothesis testing. The variance components 
may be estimated by the analysis of variance method, that is, by equating the observed 
mean squares in the lines of the analysis of variance table to their expected values and 
solving for the variance components. This yields 


a 2 = MS e 

- MS ab - MS e 
- 


MS b - MS At 


( 12 - 21 ) 
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as the point estimates of the variance components in the two-factor random effects model. 
We will discuss other methods for obtaining point estimates of the variance components 
and procedures for constructing confidence intervals in Section 12-7. 

EXAMPLE 12-2 . 

A Measurement Systems Capability Study 

Statistically designed experiments are frequently used to investigate the sources of vari¬ 
ability that affect a system. A common industrial application is to use a designed exper¬ 
iment to study the components of variability in a measurement system. These studies are 
often called gauge capability studies or gauge repeatability and reproducibility 
(R&R) studies, because these are the components of variability that are of interest. 

A typical gauge R&R experiment [from Montgomery (1996)] is shown in Table 
12-3. An instrument or gauge is used to measure a critical dimension on a part. Twenty 
parts have been selected from the production process, and three randomly selected op¬ 
erators measure each part twice with this gauge. The order in which the measurements 
are made is completely randomized, so this is a two-factor factorial experiment with 
design factors parts and operators, with two replications. Both parts and operators are 
random factors. The variance component identity in Equation 12-15 applies; namely, 

a 2 y = a 2 r + a 2 p + alp + a 2 

where a 2 y is the total variability (including variability due to the different parts, variability 
due to the different operators, and variability due to the gauge), a\ is the variance com¬ 
ponent for parts, a 2 p is the variance component for operators, al p is the variance com¬ 
ponent that represents interaction between parts and operators, and cr 2 is the random 


Table 12-3 The Measurement Systems Capability Experiment 
in Example 12-2 


Part 

Number 

Operator 1 

Operator 2 

Operator 3 

1 

21 

20 

20 

20 

19 

21 

2 

24 

23 

24 

24 

23 

24 

3 

20 

21 

19 

21 

20 

22 

4 

27 

27 

28 

26 

27 

28 

5 

19 

18 

19 

18 

18 

21 

6 

23 

21 

24 

21 

23 

22 

7 

22 

21 

22 

24 

22 

20 

8 

19 

17 

18 

20 

19 

18 

9 

24 

23 

25 

23 

24 

24 

10 

25 

23 

26 

25 

24 

25 

11 

21 

20 

20 

20 

21 

20 

12 

18 

19 

17 

19 

18 

19 

13 

23 

25 

25 

25 

25 

25 

14 

24 

24 

23 

25 

24 

25 

15 

29 

30 

30 

28 

31 

30 

16 

26 

26 

25 

26 

25 

27 

17 

20 

20 

19 

20 

20 

20 

18 

19 

21 

19 

19 

21 

23 

19 

25 

26 

25 

24 

25 

25 

20 

19 

19 

18 

17 

19 

17 
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experimental error. Typically, the variance component a 2 is called the gauge repeata¬ 
bility, because cr 2 can be thought of as reflecting the variation observed when the same 
part is measured by the same operator, and 

00 + 0-T0 

is usually called the reproducibility of the gauge, because it reflects the additional vari¬ 
ability in the measurement system resulting from use of the instrument by the operator. 
These experiments are usually performed with the objective of estimating the variance 
components. 

Table 12-4 (on the facing page) shows the analysis of variance for this experiment. 
The computations were performed using the Balanced ANOVA routine in Minitab. Based 
on the E-values, we conclude that the effect of parts is large, operators may have a small 
effect, and that there is no significant part-operator interaction. We may use Equation 
12-21 to estimate the variance components as follows: 


O't/B 


62.39 - 0.71 
(3)(2) 

1.31 ~ 0.71 
(20X2) 
0.71 - 0.99 
2 


= 10.28 
= 0.015 
= -0.14 


& 2 = 0.99 

The bottom portion of the Minitab output in Table 12-4 contains the expected 
mean squares for the random model with numbers in parentheses representing the vari¬ 
ance components [(4) represents a 2 , (3) represents a 2 ^, etc.]. The estimates of the va¬ 
riance components are also given, along with the error term that was used in testing that 
variance component in the analysis of variance. We will discuss the terminology unre¬ 
stricted model later; it has no relevance in random models. 

Notice that the estimate of one of the variance components, & 2 P , is negative. This is 
certainly not reasonable because by definition variances are nonnegative. Unfortunately, 
negative estimates of variance components can result when we use the analysis of vari¬ 
ance method of estimation (this is considered one of its drawbacks). There are a variety 
of ways to deal with this. One possibility is to assume that the negative estimate means 
that the variance component is really zero and just set it to zero, leaving the other non¬ 
negative estimates unchanged. Another approach is to estimate the variance components 
with a method that assures nonnegative estimates (we will discuss this briefly in Section 
12-7). Finally, we could note that the E-value for the interaction term in Table 12-4 is 
very large, take this as evidence that a 2 p really is zero, that there is no interaction effect, 
and fit a reduced model of the form 

yak = ii + t ,• + Pj + €, jk 

that does not include the interaction term. This is a relatively easy approach and one that 
often works nearly as well as more sophisticated methods. 

Table 12-5 (page 522) shows the analysis of variance for the reduced model. Because 



Table 12-4 Analysis of Variance (Minitab Balanced ANOVA) for Example 12-2 
Analysis of Variance (Balanced Designs) 
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Table 12- 5 Analysis of Variance for the Reduced Model, Example 12-2 
Analysis of Variance (Balanced Designs) 


Factor Type Levels Values 
part random 20 1 2 

8 9 

15 16 

operator random 312 
Analysis of Variance for y 


3 4 5 6 

10 11 12 13 

17 18 19 20 

3 


Source 

part 

operator 

Error 

Total 


DF SS 

19 1185.425 

2 2.617 

98 86.550 

119 1274.592 


MS F P 

62.391 70.64 0.000 

1.308 1.48 0.232 

0.883 


iource 

1 part 

2 operator 

3 Error 


Variance 

component 

10.2513 

0.0106 

0.8832 


Error Expected Mean Square for Each Tern 
term (using unrestricted model) 

3 (3) + 6(1) 

3 (3) + 40(2) 

(3) 


there is no interaction term in the model, both main effects are tested against the error 
term, and the estimates of the variance components are 


(3X2) 

1.31 - 0.88 


Finally, we could estimate the variance of the gauge as the sum of the variance component 
estimates & 2 and &% as 

d"gauge = & 2 + &l 

= 0.88 + 0.0108 
= 0.8908 


The variability in the gauge appears small relative to the variability in the product. This 
is generally a desirable situation, implying that the gauge is capable of distinguishing 
among different grades of product. 


12-3 THE TWO-FACTOR MIXED MODEL 

We now consider the situation where one of the factors A is fixed and the other B 
is random. This is called the mixed model analysis of variance. The linear statistical 
model is 

' i = 1,2,..., a 
j = 1, 2,..., b 
k= 1,2 


y, jk = n + T t + ft + ( t / 3 )ij + e ljk 


( 12 - 22 ) 
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Here r, is a fixed effect, ft is a random effect, the interaction (r/3) is assumed to be a 
random effect, and e iJk is a random error. We also assume that the {r,} are fixed effects 
such that Sf-jTi - 0 and ft is a NID(0, a%) random variable. The interaction effect, 
( t B) is a normal random variable with mean 0 and variance [(a - Dla]^, however, 
summing the interaction component over the fixed factor equals zero. That is. 


This restriction implies that certain interaction elements at different levels of the fixed 
factor are not independent. In fact, we may show (see Problem 12-25) that 

Cov[(r/3)y, (tj 8),y] - “ 1 * 1 

The covariance between (t/3), and (r ftV for j ± f is zero, and the random error ^is 
NID(0, a 2 ). Because the sum of the interaction effects over the levels of the fixed factor 
equals zero, this version of the mixed model is often called the restricted mo e . ^ 
q In this model the variance of (r/3)„ is defined as [(a - l)/fl]cr TP rather ^an^o 
simplify the expected mean squares. The assumption (rft. 7 - 0 also has an effect on the 
expected mean squares, which we may show are 


E(MS a ) = a- 2 + na-rp + - 

E(MS b ) = <r 2 + anal 
E(MS ab ) = a 2 + na 2 p 


E(MS e ) = o' 2 

Therefore, the appropriate test statistic for testing that the means of the fixed factor effects 
are equal, or H 0 - t, = 0, is 

_ MS a 
Fo ' MS ab 

for which the reference distribution is F a - Ua -m-iy For testing H 0 \a p - 0, the test 


with reference distribution F b ^ n - iy Finally, for testing the interaction hypothesis 
Ho : alp = 0, we would use 

_ MS ab 
F ° ~ MS e 

which has reference distribution F (a -i )(fo -i), aW „-i)- ^ 

In the mixed model, it is possible to estimate the fixed factor effects as 


A = % 


(12-24) 
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The variance components a\, and a 2 may be estimated using the analysis of vari¬ 
ance method. Eliminating the first equation from Equations 12-23 leaves three equations 
in three unknowns, whose solutions are 


and 

& 2 = MS e 

This general approach can be used to estimate the variance components in any mixed 
model. After eliminating the mean squares containing fixed factors, there will always be 
a set of equations remaining that can be solved for the variance components. 

In mixed models the experimenter may be interested in testing hypotheses or con¬ 
structing confidence intervals about individual treatment means for the fixed factor. In 
using such procedures, care must be exercised to use the proper standard error of the 
treatment mean. The standard error of the fixed effect treatment mean is 

[ Mean square for testing the fixed effect ~| m _ j MS AB 
Number of observations in each treatment meanj y bn 

Notice that this is just the standard error that we would use if this was a fixed effects 
model, except that MS E has been replaced by the mean square used for hypothesis testing. 


EXAMPLE 12-3 . 

The Measurement Systems Capability Experiment Revisited 

Reconsider the gauge R&R experiment described in Example 12-2. Suppose now that 
there are only three operators that use this gauge, so the operators are a fixed factor. 
However, because the parts are chosen at random, the experiment now involves a mixed 
model. 

The analysis of variance for the mixed model is shown in Table 12-6 on the facing 
page. The computations were performed using the Balanced ANOVA routine in Minitab. 
We specified that the restricted model be used in the Minitab analysis. Minitab also 
generated the expected mean squares for this model. In the Minitab output, the quantity 
Q [2] indicates a quadratic expression involving the fixed factor effect operator. That is, 
Q[ 2] = 2*= i f3j/(b - 1). The conclusions are similar to Example 12-2. The variance 
components may be estimated from Equation (12-25) as 

MS Parls — MS e 62.39 - 0.99 

0-par.s =-=- —— -= 10.23 

an (3)(2) 

AfSpartsXoperators ~ MS E _ 0.71 ~ 0.99 _ 

PartsXoperators „ 2 

& 2 = MS e = 0.99 

These results are also given in the M ini tab output. Once again, a negative estimate of 
the interaction variance component results. An appropriate course of action would be to 




of Variance (Minitab) for the Mixed Model in Example 12-3. The Restricted Model is Ass 
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fit a reduced model, as we did in Example 12-2. In the case of a mixed model with two 
factors, this leads to the same results as in Example 12-2. 


Alternate Mixed Models 

Several different versions of the mixed model have been proposed. These models differ 
from the restricted version of the mixed model discussed previously in the assumptions 
made about the random components. One of these alternate models is now briefly 
discussed. 

Consider the model 

ytjk = fi+ OLi + yj + (ay)ij + e ijk 

where the a, (i = 1,2,..., a ) are fixed effects such that 2f =1 a,- = 0 and y h (ay)g, and 
e iJk are uncorrelated random variables having zero means and variances V(yj) = cr y , 
l/[(ay) y ] = a 2 ay , and V(e ijk ) - a 2 . Note that the restriction imposed previously on the 
interaction effect is not used here; consequently, this version of the mixed model is often 
called the unrestricted mixed model. 

We can show that expected mean squares for this model are (refer to the supplemental 
text material for this chapter) 

bn ^ oft 

E(MS a ) = a 2 + ncr 2 ay + '"* ■■■ 

E(MS b ) = a 2 + ncr 2 y + antr 2 

E(MS ab ) = tr 2 + na 2 ay (12-26) 


E(MS e ) = a 2 

Comparing these expected mean squares with those in Equation 12-23, we note that the 
only obvious difference is the presence of the variance component a\ y in the expected 
mean square for the random effect. (Actually, there are other differences because of the 
different definitions of the variance of the interaction effect in the two models.) Conse¬ 
quently, we would test the hypothesis that the variance component for the random effect 
equals zero ( H 0 :tr 2 = 0) using the statistic 


as contrasted with testing H 0 :tr 2 p = 0 with F 0 = MS B /MS E in the restricted model. The 
test should be more conservative using this model because MS AB will generally be larger 
than MS e . 

The parameters in the two models are closely related. In fact, we may show that 
Pj = Jj + 

(tj 8 )ij = (ay)ij + (ay)j 
, , 1 2 
<Jy — CTJ3 H- <T ay 

and 


ait ) = a <*y 
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The analysis of variance method may be used to estimate the variance components. 
Referring to the expected mean squares, we find that the only change from Equations 
12-25 is that 


(12-27) 


Both of these models are special cases of the mixed model proposed by Scheffe 
(1956a, 1959). This model assumes that the observations may be represented by 

{ i = 1,2,..., a 
j = 1,2 ,...,b 
k = 1,2,... ,n 

where m,j and e ijk are independent random variables. The structure of m tJ is 

m,y = fx + T t + bj + Cij 
Elniij ) = ft + Ti 

2 ) r, = 0 


and 

c.j = 0 j= 1,2,..., b 

The variances and covariances of bj and c tJ are expressed through the covariances of the 
rriij. Furthermore, the random effects parameters in other formulations of the mixed model 
can be related to bj and c, 7 . The statistical analysis of Scheffe’s model is identical to that 
for our restricted model, except that in general the statistic MS A /MS AB is not always 
distributed as F when H 0 : r, = 0 is true. 

In light of this multiplicity of mixed models, a logical question is: Which model 
should one use? Most statisticians tend to prefer the restricted model, and it is the most 
widely encountered in the literature. The restricted model is actually slightly more general 
than the unrestricted model, because in the restricted model the covariance between two 
observations from the same level of the random factor can be either positive or negative, 
whereas this covariance can only be positive in the unrestricted model. If the correlative 
structure of the random components is not large, then either mixed model is appropriate, 
and there are only minor differences between these models. When we subsequently refer 
to mixed models, we assume the restricted model structure. However, if there are large 
correlations in the data, then Scheffe’s model may have to be employed. The choice of 
model should always be dictated by the data. The article by Hocking (1973) is a clear 
summary of various mixed models. 


EXAMPLE 12-4 . 

The Unrestricted Model 

Some computer software packages support only one mixed model. Minitab supports both 
the restricted and unrestricted model, although the default is to the unrestricted model. 
Table 12-7 (page 528) shows the Minitab output for the experiment in Example 12-3 
using the unrestricted model. Note that the expected mean squares are in agreement with 
those in Equation 12-26. The conclusions are identical to those from the restricted model 
analysis, and the variance component estimates are very similar. 
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12-4 SAMPLE SIZE DETERMINATION WITH RANDOM EFFECTS 

The operating characteristic curves in the Appendix may be used for sample size deter¬ 
mination in experiments with random factors. We begin with the single-factor random 
effects model of Section 12-1. The type II error probability for the random effects model 
is 

>3=1— P{ Reject H 0 \H 0 is false} 

= 1 - P{F 0 > F a , a _ 1JV _ a \<t 2 t > 0} (12-28) 

Once again, the distribution of the test statistic F 0 = MS Treatmen JMS E under the alternative 
hypothesis is needed. It can be shown that if Hi is true (cr 2 > 0), the distribution of F 0 
is central F with a - 1 and N — a degrees of freedom. 

Because the type II error probability of the random effects model is based on the 
usual central F distribution, we could use the tables of the F distribution in the Appendix 
to evaluate Equation 12-28. However, it is simpler to determine the sensitivity of the test 
through the use of operating characteristic curves. A set of these curves for various values 
of numerator degrees of freedom, denominator degrees of freedom, and a of 0.05 or 0.01 
is provided in Chart VI of the Appendix. These curves plot the probability of type II 
error against the parameter A, where 



Note that A involves two unknown parameters, a 2 and a 2 . We may be able to estimate 
cr 2 if we have an idea about how much variability in the population of treatments it is 
important to detect. An estimate of a 2 may be chosen using prior experience or judgment. 
Sometimes it is helpful to define the value of <r 2 we are interested in detecting in terms 
of the ratio cr 2 T lcr 2 . 

EXAMPLE 12-5 . 

Suppose we have five treatments selected at random with six observations per treatment 
and a = 0.05, and we wish to determine the power of the test if <r 2 is equal to a 2 . 
Because a = 5, n = 6, and a 2 = a 2 , we may compute 

A = Vl + 6(1) = 2.646 

From the operating characteristic curve with a — 1=4, IV — a = 25 degrees of freedom, 
and a = 0.05, we find that 

j8 - 0.20 

and thus the power is approximately 0.80. 


We can also use the percentage increase in the standard deviation of an observation 
method to determine sample size. If the treatments are homogeneous, then the standard 
deviation of an observation selected at random is a. However, if the treatments are 
different, the standard deviation of a randomly chosen observation is 


Vo - 2 + 
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If P is the fixed percentage increase in the standard deviation of an observation beyond 
which rejection of the null hypothesis is desired, 

Vcr 2 + <t\ 

- 1 = 1 + 0.01P 

a 

or 

^ = (1 + 0.01P) 2 - 1 
a 

Therefore, using Equation 12-29, we find that 

A = = Vl + n[( 1 + 0.01P) 2 - 1] (12-30) 

For a given P, the operating characteristic curves in Appendix Chart VI can be used to 
find the desired sample size. 

We can also use the operating characteristic curves for sample size determination 
for the two-factor random effects model and the mixed model. Appendix Chart VI is 
used for the random effects model. The parameter A, numerator degrees of freedom, and 
denominator degrees of freedom are shown in the top half of Table 12-8. For the mixed 
model, both Charts V and VI in the Appendix must be used. The appropriate values for 
<J> 2 and A are shown in the bottom half of Table 12-8. 

Table 12-8 Operating Characteristic Curve Parameters for Tables V and VI of the Appendix 

for the Two-Factor Random Effects and Mixed Models _ 


The Random Effects Model 



Numerator 

Denominator 



Degrees of 

Degrees of 

Factor 

A 

Freedom 

Freedom 

A 

A + A” 5 , 

a - 1 

(a - \)(b - 1) 


\ + a 2 + 


B h + an(T i - b - 1 (a - m - 1) 

V O 2 + na2 Tf3 


AB 


(a - l)(b - 

- 1) 

ab{n — 1) 

The Mixed Model 

Factor 

Parameter 

Numerator 
Degrees of 
Freedom 

Denominator 
Degrees of 
Freedom 

Appendix 

Chart 

A (Fixed) 

tai rf 

a - 1 

(a - 1 )(b - 1) 

V 


^ ala 2 + naif .d 




B (Random) 

/ anoj. 

b - 1 

ab(n — 1) 

VI 

AB 

A " 

{a - IX* - 1) 

ab(n — 1) 

VI 
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12-5 RULES FOR EXPECTED MEAN SQUARES 

An important part of any experimental design problem is conducting the analysis of 
variance. This involves determining the sum of squares for each component in the model 
and the number of degrees of freedom associated with each sum of squares. Then, to 
construct appropriate test statistics, the expected mean squares must be determined. In 
complex design situations, particularly those involving random or mixed models, it is 
frequently helpful to have a formal procedure for this process. 

We will present a set of rules for writing down the expected mean squares for any 
balanced factorial, nested, 2 or nested factorial experiment. (Note that partially balanced 
arrangements, such as Latin squares and incomplete block designs, are specifically ex¬ 
cluded.) These rules are discussed by several authors, including Scheffe (1959), Bennett 
and Franklin (1954), Cornfield and Tukey (1956), and Searle (1971a, 1971b). By ex¬ 
amining the expected mean squares, one may develop the appropriate statistic for testing 
hypotheses about any model parameter. The test statistic is a ratio of mean squares that 
is chosen such that the expected value of the numerator mean square differs from the 
expected value of the denominator mean square only by the variance component or the 
fixed factor in which we are interested. 

It is always possible to determine the expected mean squares in any model as we 
did in Chapter 3, that is, by the direct application of the expectation operator. This brute 
force method, as it is often called, can be very tedious. The rules that follow always 
produce the expected mean squares without resorting to the brute force approach, and, 
with practice, they become relatively simple to use. When applied to a mixed model, 
these rules produce expected mean squares that are consistent with the assumptions for 
the restricted mixed model of Section 12-3. We illustrate the rules using the two-factor 
fixed effects factorial model. 

Rule 1. The error term in the model, e# ... m , is written as e (ij _._ )m , where the subscript m 
denotes the replication subscript. For the two-factor model, this rule implies that e ijk 
becomes 


Rule 2. In addition to an overall mean (/a) and an error term [e ( y ,. )m ], the model contains 
all the main effects and any interactions that the experimenter assumes exist. If all pos¬ 
sible interactions between k factors exist, then there are ( 2 ) two-factor interactions, (3) 
three-factor interactions,..., 1 ^-factor interaction. If one of the factors in a term appears 
in parentheses, then there is no interaction between that factor and the other factors in 
that term. 


Rule 3. For each term in the model, divide the subscripts into three classes: (a) live— 
those subscripts that are present in the term and are not in parentheses; (b) dead—those 
subscripts that are present in the term and are in parentheses; and (c) absent—those 
subscripts that are present in the model but not in that particular term. 

Thus, in (t/3),,, i and j are live and k is absent, and in e (iJ)k , k is live and i and j are 
dead. 


2 Nested designs are discussed in Chapter 13. 
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Rule 4. Degrees of freedom. The number of degrees of freedom for any term in the 
model is the product of the number of levels associated with each dead subscript and the 
number of levels minus 1 associated with each live subscript. 

For example, the number of degrees of freedom associated with ( t / 3 ),, is {a - 1 ) 
(i b - 1), and the number of degrees of freedom associated with e (ijjk is ab(n - 1). 

Rule 5. Each term in the model has either a variance component (random effect) or a 
fixed factor (fixed effect) associated with it. If an interaction contains at least one random 
effect, the entire interaction is considered as random. A variance component has Greek 
letters as subscripts to identify the particular random effect. Thus, in a two-factor mixed 
model with factor A fixed and factor B random, the variance component for B is a% and 
the variance component for AB is (r 2 p. A fixed effect is always represented by the sum 
of squares of the model components associated with that factor divided by its degrees of 
freedom. In our example, the effect of A is 

|_rf 

a - 1 

Rule 6. Expected mean squares. To obtain the expected mean squares, prepare the 
following table. There is a row for each model component (mean square) and a column 
for each subscript. Over each subscript, write the number of levels of the factor associated 
with that subscript and whether the factor is fixed (F) or random (R). Replicates are 
always considered to be random. 

(a) In each row, write 1 if one of the dead subscripts in the row component matches 
the subscript in the column: 


Factor 

F 

F 

b 

j 

R 

k 

TV 




ft 




Wh 




e 0»t 

1 

1 



(b) In each row, if any of the subscripts on the row component match the subscript 
in the column, write 0 if the column is headed by a fixed factor and 1 if the 
column is headed by a random factor: 


F F 

a b 

Factor i j 

T; 0 

ft 0 

(t/% 0 0 

1 1 


R 


k 
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(c) In the remaining empty row positions, write the number of levels shown above 
the column heading: 


Factor 

F 

F 

b 

j 

R 

k 

TV 

0 

b 

n 

ft 

a 

0 

n 


0 

0 

n 


1 

1 

1 


(d) To obtain the expected mean square for any model component, first cover all 
columns headed by live subscripts on that component. Then, in each row that 
contains at least the same subscripts as those on the component being 
considered, take the product of the visible numbers and multiply by the 
appropriate fixed or random factor from rule 1. The sum of these quantities is 
the expected mean square of the model component being considered. To find 
E(MS a ), for example, cover column i. The product of the visible numbers in 
the rows that contain at least subscript i are bn (row 1), 0 (row 3), and 1 (row 
4). Note that i is missing in row 2. Therefore, the expected mean square is 

bn 2 t 2 

E(MS a ) = a 2 + — 

a - 1 

The complete table of expected mean squares for this design is shown in Table 12-9. 
Tables 12-10 and 12-11 (page 534) display the expected mean square derivations for the 
two-factor random and mixed models, respectively. Notice that the restricted version of 
the mixed model has been assumed in producing the expected mean squares. A three- 
factor factorial design is treated in the following example. 

EXAMPLE 12-6 . 

Consider a three-factor factorial experiment with a levels of factor A, b levels of factor 
B, c levels of factor C, and n replicates. The analysis of this design, assuming that all 
the factors are fixed effects, is given in Section 5-4. We now determine the expected 


Table 12-9 Expected Mean Square Derivation, Two-Factor Fixed 
Effects Model 


Factor 

F 

F 

b 

j 

R 

k 

Expected 

Mean Square 

T . 

0 

b 

n 

2 bnZrf 

ft 


0 


a - 1 
anhfij 
tr 2 + - - - 

(T0)* 

€ oy>* 

0 

1 

0 

1 

1 

b — 1 

, | /iX£(t P)l 

+ (a - m - 1) 

o 2 
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Table 12-10 Expected Mean Square Derivation, Two-Factor Random 
Effects Model 


Factor 

R 

R 

b 

j 

R 

k 

Expected 

Mean Square 

Ti 

1 

b 

n 

a 2 + no^p + bnoi 

Pj 

a 

1 

n 

a 2 + na^p + anop 

mu 

1 

1 

n 

a 2 + noip 


1 

1 

1 

a 2 


Table 12-11 

Expected Mean Squa 
Mixed Model 

ire Derivation, Two-Factor 


F 

R 

R 



a 

b 

n 

Expected 

Factor 

i 

j 

k 

Mean Square 






T/ 

0 

b 

n 

a 2 + noip + -- j- 

Pi 

a 

1 

n 

cr 2 + anal 

mu 

0 

1 

n 

a 2 + na 2 p 


1 

1 

1 

cr 2 


Table 12-12 Expected Mean Square Derivation, Three-Factor Random Effects Model 


Factor 

R 

R 

b 

j 

R 

k 

R 

l 

Expected Mean Squares 

T, 

1 

b 

c 

n 

a 2 + cnalp + bnaly + na 2 r p y + bcna 2 

Pi 

a 

1 

c 

n 

a 2 + cnalp + ana\ y + ncr 2 p y + aerial 

Jk 

a 

b 

1 

n 

a 2 + bnaXy + ana 2 p y + na 2 rPy + abna 2 y 

m,j 

1 

1 

c 

n 

a 2 + noipy + cna 2 rP 

(ry)ik 

1 

b 

1 

n 

a 2 + n&lp y + bnal y 

(pyb 

a 

1 

1 

n 

a 2 + na 2 r p y + ano% y 

<jPi)ijk 

1 

1 

1 

n 

a 2 + nalpy 


1 

1 

1 

1 

a 2 
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mean squares assuming that all the factors are random. The appropriate statistical model 
is 


y ijk i = ix + Ti + pj + y k + (Tp),j + (ry),* + (Py) ]k + (Tpy) ijk + e iJkl 


Using the rules previously described, the expected mean squares are derived in Table 
12-12 on the facing page. 

We notice, by examining the expected mean squares in Table 12-12, that if A, B, 
and C are all random factors, then no exact test exists for the main effects. That is, if we 
wish to test the hypothesis a 2 = 0, we cannot form a ratio of two expected mean squares 
such that the only term in the numerator that is not in the denominator is bcncr 2 . The 
same phenomenon occurs for the main effects of B and C. Notice that proper tests do 
exist for the two- and three-factor interactions. However, it is likely that tests on the main 
effects are of central importance to the experimenter. Therefore, how should the main 
effects be tested? This problem is considered in the next section. 


12-6 APPROXIMATE F TESTS 

In factorial experiments with three or more factors involving a random or mixed model 
and certain other, more complex designs, there are frequently no exact test statistics for 
certain effects in the models. One possible solution to this dilemma is to assume that 
certain interactions are negligible. To illustrate, if we could reasonably assume that all 
the two-factor interactions in Example 12-6 are negligible, then we could put alp = 
<j 2 ry = <j 2 p y - 0, and tests for main effects could be conducted. 

Although this seems to be an attractive possibility, we must point out that there must 
be something in the nature of the process—or some strong prior knowledge—for us to 
assume that one or more of the interactions are negligible. In general, this assumption is 
not easily made, nor should it be taken lightly. We should not eliminate certain inter¬ 
actions from the model without conclusive evidence that it is appropriate to do so. A 
procedure advocated by some experimenters is to test the interactions first, then set at 
zero those interactions found to be insignificant, and then assume that these interactions 
are zero when testing other effects in the same experiment. Although sometimes done in 
practice, this procedure can be dangerous because any decision regarding an interaction 
is subject to both type I and type II errors. 

A variation of this idea is to pool certain mean squares in the analysis of variance 
to obtain an estimate of error with more degrees of freedom. For instance, suppose that in 
Example 12-6 the test statistic F 0 = MS ABC /MS E was not significant. Thus, H 0 : a 2 p y = 0 
is not rejected, and both MS ABC and MS E estimate the error variance a 2 . The experimenter 
might consider pooling or combining MS ABC and MS E according to 

= abc(n - 1 )MS e + (a - \)(b - l)(c - l)MS ABC 
abdji - 1) + (a — 1 )(b — l)(c - 1) 

so that E(MS e ') - a 2 . Note that MS E - has abc(n — 1) + (a — 1 )(b- l)(c - 1) degrees 
of freedom, compared to abc(n — 1) degrees of freedom for the original MS E . 

The danger of pooling is that one may make a type II error and combine the mean 
square for a factor that really is significant with error, thus obtaining a new residual mean 
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square (MS/_) that is too large. This will make other significant effects more difficult to 
detect. On the other hand, if the original error mean square has a very small number of 
degrees of freedom (e.g., less than six), the experimenter may have much to gain by 
pooling because it could potentially increase the precision of further tests considerably. 
A reasonably practical procedure is as follows. If the original error mean square has six 
or more degrees of freedom, do not pool. If the original error mean square has fewer 
than six degrees of freedom, pool only if the F statistic for the mean square to be pooled 
is not significant at a large value of a, such as a = 0.25. 

If we cannot assume that certain interactions are negligible and we still need to make 
inferences about those effects for which exact tests do not exist, a procedure attributed 
to Satterthwaite (1946) can be employed. Satterthwaite’s method uses linear combina¬ 
tions of mean squares, for example, 

MS' = MS r + • • • + MS s (12-31) 


and 


MS" = MS U + • ■ ■ + MS V 


(12-32) 


where the mean squares in Equations 12-31 and 12-32 are chosen so that E(MS’) - 
E(MS") is equal to a multiple of the effect (the model parameter or variance component) 
considered in the null hypothesis. Then the test statistic would be 


F 


MS' 

MS" 


(12-33) 


which is distributed approximately as F p<q , where 


(MS r + • • • + MSf 
MS 2 r /f r + • • • + MS 2 /f s 


(12-34) 


and 


(MS U + • • • + MS v f 
MSl/f u + ■ ■ ■ + MS 2 Jf v 


(12-35) 


In p and q,fi is the number of degrees of freedom associated with the mean square MS,. 
There is no assurance that p and q will be integers, so it may be necessary to interpolate 
in the tables of the F distribution. For example, in the three-factor random effects model 
(Table 12-12), it is relatively easy to see that an appropriate test statistic for FI Q :a 2 T = 0 
would be F = MS'/MS", with 


MS' = MS a + MS abc 


and 


MS" = MS ab + MS ac 

The degrees of freedom for F would be computed from Equations 12-34 and 12-35. 

The theory underlying this test is that both the numerator and the denominator of 
the test statistic (Equation 12-33) are distributed approximately as multiples of chi-square 
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random variables, and because no mean square appears in both the numerator or denom¬ 
inator of Equation 12-33, the numerator and denominator are independent. Thus F in 
Equation 12-33 is distributed approximately as F pq . Satterthwaite remarks that caution 
should be used in applying the procedure when some of the mean squares in MS' and 
MS" are involved negatively. Gaylor and Hopper (1969) report that if MS' = MS, — 
MS 2 , then Satterthwaite’s approximation holds reasonably well if 


MSi 

MS 2 


> Fo.025,f 2 .f, 


X Fo.50,f 2 ,f 2 


and if *£ 100 and / 2 3= fJ2. 


EXAMPLE 12-7 . 

The pressure drop measured across an expansion value in a turbine is being studied. The 
design engineer considers the important variables that influence pressure drop reading to 
be gas temperature on the inlet side (A), operator (B), and the specific pressure gauge 
used by the operator (C). These three factors are arranged in a factorial design, with gas 
temperature fixed, and operator and pressure gauge random. The coded data for two 
replicates are shown in Table 12-13. 7116 linear model for this design is 

y.jki = /A + T,- + Pj + y k + ( Tp)ij + (ry),* + (j 8y) Jk + (rj8y) (>vt + e iJkl 

where r, is the effect of the gas temperature (A), jS, is the operator effect (B), and y k is 
the effect of the pressure gauge (C). 

The analysis of variance is shown in Table 12-14. A column entitled “Expected 
Mean Squares” has been added to this table, and the entries in this column are derived 
by the methods discussed in Section 12-5. From the Expected Mean Squares column, 
we observe that exact tests exist for all effects except the main effect A. Results for these 
tests are shown in Table 12-14 on the facing page. To test the gas temperature effect, or 
H 0 : r, = 0, we could use the statistic 


F 


MS’ 

MS" 


Table 12-13 Coded Pressure Drop Data for the Turbine Experiment 


Pressure 

(C) 





Gas Temperature (A) 





60°F 



75°F 



90°F 


Operator (B) 



Operator (B) 



Operator (5) 

1 

2 

3 

4 

1 

2 

3 

4 

1 

2 

3 4 

1 

-2 

0 

-1 

4 

14 

6 

1 

-7 

-8 

-2 

-1 -2 


-3 

-9 

-8 

4 

14 

0 

2 

6 

-8 

20 

-2 1 

2 

-6 

-5 

-8 

-3 

22 

8 

6 

-5 

-8 

1 

-9 -8 


4 

-1 

-2 

-7 

24 

6 

2 

2 

3 

-7 

-8 3 

3 

-1 

-4 

0 

-2 

20 

2 

3 

-5 

-2 

-1 

-4 1 


-2 

-8 

-7 

4 

16 

0 

0 

-1 

-1 

-2 

-7 3 
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where 


MS' = MS a + MS abc 
and 

MS" = MS ab + MS ac 

because 


E(MS') - E(MS") = 


To determine the test statistic for H 0 : t■ = 0, we compute 

MS' = MS a + MS abc 

= 511.68 + 13.84 = 525.52 
MS" = MS ab +MS ac 

= 202.00 + 34.47 = 236.47 


and 


F 


MS' 

MS" 


525,52 _ 
236.47 = 


The degrees of freedom for this statistic are found from Equations 12-34 and 12-35 as 
follows: 


and 


(MS a + MS ABC ) 2 
MSl/2 + MS\ bc I\2 
(525.52) 2 

(511.68) 2 /2 + (13.84) 2 /12 


= 2.11 « 2 


(MS ab + MS ac ) 2 
MSls/6 + MS 2 ac /4 
(236.47) 2 

(202.00) 2 /6 + (34.47) 2 /4 


Comparing F = 2.22 to F 0 . 05 , 2,8 = 4.46, we ca nn ot reject H 0 . The P-value is approxi¬ 
mately P = 0.17. 

The AB, or temperature-operator, interaction is large, and there is some indication 
of an AC, or temperature-gauge, interaction. The graphical analysis of the AB and AC 
interactions, shown in Figure 12-2 (page 540), indicates that the effect of temperature 
may be large when operator 1 and gauge 3 are used. Thus, it seems possible that the 
main effects of temperature and operator are masked by the large AB interaction. 


Table 12-15 (page 541) presents the Minitab Balanced ANOVA output for the ex¬ 
periment in Example 12-7. We have specified the restricted model. Q[ 1] represents the 
fixed effect of gas pressure. Notice that the entries in the analysis of variance table are 
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Figure 12-2 Interactions in pressure drop experiment. 


in general agreement with those in Table 12-14, except for the F test on gas temperature 
(factor A). Minitab notes that the test is not an exact test (which we see from the expected 
mean squares). The Synthesized Test constructed by Minitab is actually Satterthwaite’s 
procedure, but it uses a different test statistic than we did. Note that, from the Minitab 
output, the error mean square for testing factor A is 

(4) + (5) - (7) = MS ab + MS ac ~ MSabc 
for which the expected value is 

E[( 4) + (5) - (7)] = a 2 + na% y + cna-% + a 2 + na% y 
+ bnaly - (a 2 + ncr 2 T p y ) 

= a 2 + ncrlpy + cna 2 rP + bna 2 y 

which is an appropriate error mean square for testing the mean effect of A. This nicely 
illustrates that there can be more than one way to construct the synthetic mean squares 
used in Satterthwaite’s procedure. However, we would generally prefer the linear com¬ 
bination of mean squares we selected instead of the one chosen by Minitab because it 
does not have any mean squares involved negatively in the linear combinations. 

The analysis of Example 12-7, assuming the unrestricted model, is presented in Table 
12-16 (page 542). The principal difference from the restricted model is that now the 
expected values of the mean squares for all three mean effects are such that no exact test 
exists. In the restricted model, the two random mean effects could be tested against their 
interaction, but now the expected mean square for B involves a% y and a 2 p , and the 
expected mean square for C involves a% y and cr 2 Tr Once again, Minitab constructs 
synthetic mean squares and tests these effects with Satterthwaite s procedure. The overall 
conclusions are not radically different from the restricted model analysis, other than the 
large change in the estimate of the operator variance component. The unrestricted model 
produces a negative estimate of a 2 p. Because the gauge factor is not significant in either 
analysis, it is possible that some model reduction is in order. 



Analysis of Variance for Drop 
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Error Terms for Synthesized Tests 










Analysis of Variance (Balanced Designs) 



542 


Error Terms for Synthesized Tests 
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12-7 SOME ADDITIONAL TOPICS ON ESTIMATION 
OF VARIANCE COMPONENTS 

As we have previously observed, estimating the variance components in a random or 
mixed model is frequently a subject of considerable importance to the experimenter. 
In this section, we present some further results and techniques useful in estimating vari¬ 
ance components. We concentrate on procedures for finding confidence intervals on 
variance components, and we also illustrate how to find maximum likelihood estimates 
of variance components. The maximum likelihood method may be a useful alternative 
when the analysis of variance method produces negative estimates. 


12-7.1 Approximate Confidence Intervals on Variance Components 

When the random effects model was introduced in Section 12-1, we presented exact 
100(1 — a) percent confidence intervals for a 2 and for other functions of the variance 
components in that simple experimental design. It is always possible to find an exact 
confidence interval on any function of the variance components that is the expected value 
of one of the mean squares in the analysis of variance. For example, consider the error 
mean square. Because E(MS e ) = cr 2 , we can always find an exact confidence interval 
on cr 2 because the quantity 

f E MS E /<r 2 = f E & 2 /a 2 

has a chi-square distribution with f E degrees of freedom. The exact 100(1 - a) percent 
confidence interval is 


f E MS E 2 ^ f E MS E 

— - (T ^ j- 

Xa/2,f E Xl~oc/2,f E 


(12-36) 


Unfortunately, in more complex experiments involving several design factors, it is 
generally not possible to find exact confidence intervals on the variance components of 
interest, because these variances are not the expected value of a single mean square from 
the analysis of variance. However, the concepts underlying Satterthwaite’s approximate 
“pseudo” F tests, introduced in Section 12-6, can be employed to construct approximate 
confidence intervals on variance components for which no exact confidence interval is 
available. 

Recall that Satterthwaite’s method uses two linear combinations of mean squares 


MS' = MS r + • • • + MS s 


and 


with the test statistic 


MS" = MS U + ■ ■ ■ + MS V 


F 


MS' 

MS" 


having an approximate F distribution. Using appropriate degrees of freedom for MS' and 
MS", defined in Equations 12-34 and 12-35, this F statistic can be used in an approximate 
test of the significance of the parameter or variance component of interest. 
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For testing the significance of a variance component, say al, the two linear combi¬ 
nations, MS' and MS", are chosen such that the difference in their expected values is 
equal to a multiple of the component, say 

E(MS') - E(MS") = kal 


, E(MS’) - E(MS") 

* 0 = - J -• 

Equation 12-37 provides a basis for a point estimate of <r§: 
MS' - MS" 


&l = - 


• + - MS s - - MS U - • 
k k 


- MS V 


(12-37) 


(12-38) 


The mean squares (MS,) in Equation 12-38 are independent with/MS, /of = SSJa 2 
having chi-square distributions with/ degrees of freedom. The estimate of the variance 
component, &l, is a linear combination of multiples of the mean squares, and r&ll(rl 
has an approximate chi-square distribution with r degrees of freedom, where 


(Hr 


V 1 MS ‘ 

hk 2 / 

_ (MS r + ■■■ 


h MS s - MS U ■ 


- MS v f 


MS 2 

fr 


l MSI 

" fs 


L ^Jt 
L fu 


l MS| 

L fv 


(12-39) 


This result can only be used if &l > 0. As r will usually not be an integer, interpolation 
from the chi-square tables will generally be required. Graybill (1961) derives a general 
result for r. 

Now because r&l/a 2 has an approximate chi-square distribution with r degrees of 
freedom, 


and 


P *«/2,r] - 1 

r&l I = ] 

Yl-a/2,rj 


o f r& 0 2 c T& 0 

P I —5 - (To — 

l/V'a/2,r Xl- . 

Therefore an approximate 100(1 - a) percent confidence interval on a\ is 

r &0 2 _ r&l 

—— ^ <r 0 ^ 2 

Xa/2,r Xl-<*/2„ 


(12-40) 


EXAMPLE 12-8 . 

To illustrate this procedure, reconsider the experiment in Example 12-7, where a three- 
factor mixed model is used on a study of the pressure drop across an expansion valve of 
a turbine. The model is 

y m = /a + t, + Pj + y k + (tP),j + (ry) ik + (Py) jk + (rPy), ]k + e ijkl 
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where r, is a fixed effect and all other effects are random. We will find an approximate 
confidence interval on cr 2 ^. Using the expected mean squares in Table 12-14, we note 
that the difference in the expected values of the mean squares for the two-way interaction 
effect AS and the three-way interaction effect ABC is a multiple of the variance com¬ 
ponent of interest, al p \ 


E(MS ab ) - E(MS abc ) = cr 2 + nal Py + cna 2 rP - (cr 2 + nal Py ) 


Therefore, the point estimate of al p is 


„ 2 MS ab - MS 134.91 - 19.26 _ 

-«-= “ (3X2) = 19 - 28 


r = _ (MSab - MS abc f _ = (134.91 - 19.26) 2 

MS 2 ab t MSIbc (134.91) 2 (19.26) 2 

{a - 1 )(b - 1) (a - l)(b - 1 )(c - 1) (2)(3) (2)(3)(2) 

The approximate 95 percent confidence interval on al p is then found from Equation 12- 
40 as follows: 



(4.36)(19.28) 2 (4.36)(19.28) 

11.58 " ^ ^ Oil 


7.26 =S a 2 T p ^ 137.81 

This result is consistent with the results of the exact F test on alp, in that there is strong 
evidence that this variance component is not zero. 


12-7.2 The Modified Large-Sample Method 

The Satterthwaite method in the previous section is a relatively simple way to find an 
approximate confidence interval on a variance component that can be expressed as a 
linear combination of mean squares, say 

Q 

do = 2 c MSi (12-41) 

The Satterthwaite method works well when the degrees of freedom on each mean square 
MSj are all relatively large, and when the constants c, in Equation 12-41 are all positive. 
However, sometimes some of the c, are negative. Graybill and Wang (1980) proposed a 
procedure called the modified large-sample method, which can be a very useful alter¬ 
native to Satterthwaite’s method. If all of the constants c, in Equation 12-41 are positive, 
then the approximate 100(1 - a) percent modified large-sample confidence interval on 


GjcfMSj ^ (To 5 


fZ HfcfMSj 
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where 


G, = 1 - T. - and H, = J— - 1 

Note that an F random variable with an infinite number of denominator degrees of free¬ 
dom is equivalent to a chi-square random variable divided by its degrees of freedom. 

Now consider the more general case of Equation 12-41, where the constants c, are 
unrestricted in sign. This may be written as 

&l = 2 CiMS, ~ 2 1 CjMSj, c h Cj 3* 0 (12-43) 

Ting et al. (1990) give an approximate 100(1 - a) percent lower confidence limit on 
o-gas 

L = &o ~ W L (12-44) 


where 


v, = y GjcjMSj + 2 H J c j MS l + i S GjjCjCjMSiMSj 

,= 1 j=P +1 i=l j~P +1 

P-1 P 

+ 2 2 G%c iCt MSiMS t 

Wl.- 

1 


Hj 


Fi-a, 




G _ - l) 2 - - H J 


17, 1 

Gift 

Gif'] 

|_V Fajrfj fif 

ft 

fi J 


if P > 1 and G*, = 0 if P = 1 

These results can also be extended to include approximate confidence intervals on 
ratios of variance components. For a complete account of these methods, refer to the 
excellent book by Burdick and Graybill (1992). 


EXAMPLE 12-9 ... 

To illustrate the modified large-sample method, reconsider the three-factor mixed model 
in Example 12-7. We will find an approximate 95 percent lower confidence interval on 
a 2 T p. Recall that the point estimate of alp is 


al a = 


MS ab - MS Al 


134.91 - 19.26 

(3X2) 


= 19.28 
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Therefore, in the notation of Equation 12-43, c, = c 2 = g, and 


Gj 

H 2 

g 12 


^0.05,6,= 

1 


1 


(^0.05,6,12 l) 2 (f7i) 2 Fo.Q5,6, 1 2 (H 2 ) 2 

^0.05,6,12 

(3.00 - l) 2 - (0.524) 2 (3.00) 2 - (1.3) 2 
3.00 


-0.054 


G* = 0 


From Equation 12-44 

= GlfiMSls + H\c\MS\ BC + Gi 2 C\C 2 M.S ab MS A bc 
= (0.524) 2 ( l/6) 2 ( 134.91) 2 + (1.3) 2 (1/6) 2 (19.26) 2 
+ (—0.054)( l/6)( l/6)( 134.91)(19.26) 

= 152.36 

So an approximate 95 percent lower confidence limit on <r 2 T p is 

L = &%- W L = 19.28 - VI52J6 = 6.94 
This result is consistent with the results of the exact F test for this effect. 


12-7.3 Maximum Likelihood Estimation of Variance Components 

This chapter has emphasized the analysis of variance method of variance component 
estimation because it is relatively straightforward and makes use of familiar quantities— 
the mean squares in the analysis of variance table. However, the method has some dis¬ 
advantages, including an embarrassing tendency to produce negative estimates on oc¬ 
casion. Furthermore, the analysis of variance method is really a method of moments 
estimator, a technique that mathematical statisticians generally do not prefer to use for 
parameter estimation because it often results in parameter estimates that do not have 
good statistical properties. The preferred parameter estimation technique is called the 
method of maximum likelihood. The implementation of this method can be somewhat 
involved, particularly for an experimental design model, but in a sense, the method of 
maximum likelihood selects parameter estimates that, for a specified model and error 
distribution, maximize the probability of occurrence of the sample results. A very nice 
overview of the method of maximum likelihood applied to experimental design models 
is given by Milliken and Johnson (1984). 

A complete presentation of the method of maximum likelihood is beyond the scope 
of this book, but the general idea can be illustrated very easily. Suppose that x is a random 
variable with probability distribution f(x; 0), where 0 is an unknown parameter. Let x u 
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. . X n be a random sample of n observations. Then the likelihood function of the 

sample is 

L(ff) = f(xi, 9) • f(x 2 \ 9) . f(x n ; 6) 

Note that the likelihood function is now a function of only the unknown parameter ft 
The maximum likelihood estimator of 6 is the value of 9 that maximizes the likelihood 
function L{6). 

To illustrate how this applies to an experimental design model with random ettects, 
consider a two-factor model with a = b = n = 2. The model is 
y iJk = p, + t, + (5j + (r0)y + tijk 

with i = 1, 2, j = 1, 2, and k = 1, 2. The variance of any observation is 
V(y iJk ) = a ? = <r 2 T + o-js + + v 2 

and the covariances are 

Co v(y ijk , yrj’ k ') = <t 2 t + vl + crip i = i',j = j',k± k' 

= <rl i = j * J" 

= a 2 p i ^ i\j = / 

= 0 (12-45) 

It is convenient to think of the observations as an 8 X 1 vector, say 


'ha' 

yu2 

yin 



ym 

y 22 i 

ym 


and the variances and covariances can be expressed as an 8 X 8 covariance matrix 




o-p op 0 0 

= o 2 p o 2 p 0 0 

12 0 0 o 2 p o 2 p 

_0 0 o 2 p o 2 p_ 
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and X 2 i is just the transpose of Xi 2 - Now each observation is normally distributed with 
variance a], and if we assume that all N = abn observations have a joint normal distri¬ 
bution, then the likelihood function for the random model becomes 

L(/r, a 2 T , a 2 p , alp, a 2 ) = 1/2 ex P (y ~ jv/*)'X _1 (y - j*/z) 

where j N is an N X 1 vector of Is. The maximum likelihood estimates of fi, 
al, a% alp, and a 2 are those values of these parameters that maximize the likelihood 
function. It would also be desirable to restrict the variance component estimates to non¬ 
negative values. So in practice, we would maximize the likelihood function subject to 
this constraint. 

Estimating variance components by maximum likelihood requires specialized com¬ 
puter software. Some general statistical software packages have this capability. The SAS 
system computes maximum likelihood estimates of variance components in random or 
mixed models through SAS PROC MIXED. We will illustrate the use of PROC MIXED 
by applying it to the two-factor factorial model introduced in Examples 12-2 and 12-3. 

First consider Example 12-2. The model is a two-factor factorial, random effects 
model. The analysis of variance method has produced a negative estimate of the inter¬ 
action variance component. Negative estimates of variance components can be avoided 
in PROC MIXED by specifying that a restricted (or residual) maximum likelihood 
(REML) method be used. REML essentially constrains the variance component estimates 
to nonnegative values. 

SAS PROC MIXED requires as input the covariance matrix of the model parameters. 
The structure for a random model where all the random variables are mutually indepen¬ 
dent is 


all 0 0 ' 

0 oft. 0 

. 0 0 alpl. 


(12-46) 


where the I’s are identity matrices. (The covariance structure of a model can be specified 
in PROC MIXED by the TYPE = structure option in the RANDOM statement. The 
covariance structure for the model in Example 12-2 is specified by TYPE = SIM (the 
default in PROC MIXED), which specifies the simple covariance structure for the model 
parameters given in Equation 12-46. 

Table 12-17 (page 550) presents the output from SAS PROC MIXED for the ex¬ 
periment in Example 12-2. We specified the REML method of variance component es¬ 
timation. The output has been annotated with numbers to facilitate the description shown 
below: 


1. Variance component estimates and related output. 

2. Covariance parameter. This identifies the model parameters: al, a 2 p , alp, 
and a 2 . 

3. Ratio of estimated variance of effect to estimated residual error variance: 

af/a 2 

4. Parameter estimates. These are the REML estimates of the variance compo¬ 
nents al, a 2 p, alp, and a 2 . Notice that the REML estimate of alp is zero. 



Table 12-17 SAS PROC MIXED Output for Analysis of Gauge Repeatability and Reproducibility Study in Example 12-2 Using REML Esti 
Variance Components_______ 



550 


Standard Deviati 



Table 12-18 SAS PROC MIXED Output for Analysis of Gauge Repeatability and Reproducibility Study with the Operator as a Fixed Effect Using REML 
Estimation of Variance Components 



551 


DDF Type III 
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5. Standard error of estimate. Th is is t he large-sample standard error of the 
parameter estimate; se(&j) = VV(df). 

6. Z statistic associated with the estimated variance: 

Z = &j/se (&f). 

7. P-value of the computed Z statistic. 

8 . Alpha level used to compute confidence interval. 

9. Lower and upper bounds of a 100(1 - a) percent large-sample normal theory 
confidence interval on the variance components: 

L = of - Z a/2 se(d f) U = + Z a/2 se(&J) 

10. Asymptotic covariance matrix of estimates. This is the large-sample covariance 
matrix of the variance components estimates. 

11. Model fitting measures for comparing the fit of alternative models. 

Notice that the results from SAS PROC MIXED fairly closely match those reported in 
Example 12-2 when the reduced model (without the interaction term) was fit to the data. 

In Example 12-3 we considered the same experiment but assumed that the operators 
were a fixed factor, leading to a mixed model. SAS PROC MIXED can be employed to 
estimate the variance components for this situation. The covariance structure for the 
observations, assuming that all random variables are mutually independent (that is, the 
unrestricted mixed model), is 

Cov(y ijk , yr/ic') = Op + alp + <r 2 i = j = /> k = k 

= + a*# i = i', j = /> * * k ' (12-47) 

= (j\ i±i',j=j' 

= 0 j + j' 

The covariance matrix of the model parameters is 

G = H 1 ° 1 ( 1248) 

L o o-yj 

(This is specified by the TYPE=SIM statement in the input to PROC MIXED.) The SAS 
PROC MIXED output for the unrestricted form of the mixed model of Example 12-3 is 
shown in Table 12-18 on the previous page. Once again, the REML method was selected. 
The estimate for the variance component for the factor “part is very similar to the 
estimate obtained using the random model. The estimate for the residual error variance 
is also similar. In addition, the output includes an F test for the fixed effect. 

12-8 PROBLEMS - 

12-1. A textile mill has a large number of looms. Each loom is supposed to provide the same 
output of cloth per minute. To investigate this assumption, five looms are chosen at 
random and their output is noted at different times. The following data are obtamed: 


Loom 



Output (lb/min) 




14.0 

14.1 

14.2 

14.0 

14.1 

2 

13.9 

13.8 

13.9 

14.0 

14.0 

3 

14.1 

14.2 

14.1 

14.0 

13.9 

4 

13.6 

13.8 

14.0 

13.9 

13.7 

5 

13.8 

13.6 

13.9 

13.8 

14.0 
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(a) Explain why this is a random effects experiment. Are the looms equal in output? 
Use a = 0.05. 

(b) Estimate the variability between looms. 

(c) Estimate the experimental error variance. 

(d) Find a 95 percent confidence interval for aV(ai + a 2 ). 

(e) Analyze the residuals from this experiment. Do you think that the analysis of vari¬ 
ance assumptions are satisfied? 

12-2. A manufacturer suspects that the batches of raw material furnished by his supplier differ 
significantly in calcium content. There are a large number of batches currently in the 
warehouse. Five of these are randomly selected for study. A chemist makes five deter¬ 
minations on each batch and obtains the following data: 


Batch 1 Batch 2 Batch 3 Batch 4 Batch 5 


23.46 23.59 

23.48 23.46 

23.56 23.42 

23.39 23.49 

23.40 23.50 


23.51 23.28 

23.64 23.40 

23.46 23.37 

23.52 23.46 

23.49 23.39 


23.29 

23.46 

23.37 
23.32 

23.38 


(a) Is there significant variation in calcium content from batch to batch? Use a = 0.05. 

(b) Estimate the components of variance. 

(c) Find a 95 percent confidence interval for + a 2 ). 

(d) Analyze the residuals from this experiment. Are the analysis of variance assump¬ 
tions satisfied? 

12-3. Several ovens in a metal working shop are used to heat metal specimens. All the ovens 
are supposed to operate at the same temperature, although it is suspected that this may 
not be true. Three ovens are selected at random and their temperatures on successive 
heats are noted. The data collected are as follows: 


Oven 



Temperature 



1 

491.50 

498.30 

498.10 

493.50 

493.60 


2 

488.50 

484.65 

479.90 

477.35 



3 

490.10 

484.80 

488.25 

473.00 

471.85 

478.65 


(a) Is there significant variation in temperature between ovens? Use a = 0.05. 

(b) Estimate the components of variance for this model. 

(c) Analyze the residuals from this experiment and draw conclusions about model 
adequacy. 

12-4. An article in the Journal of the Electrochemical Society (Vol. 139, No. 2,1992, pp. 524- 
532) describes an experiment to investigate the low-pressure vapor deposition of poly¬ 
silicon. The experiment was carried out in a large-capacity reactor at Sematech in Austin, 
Texas. The reactor has several wafer positions, and four of these positions are selected 
at random. The response variable is film thickness uniformity. Three replicates of the 
experiment were run, and the data are as follows: 


Uniformity 

2.76 5.67 4.49 

1.43 1.70 2.19 

2.34 1.97 1.47 

0.94 1.36 1.65 


Wafer Position 
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(a) Is there a difference in the wafer positions? Use a = 0.05. 

(b) Estimate the variability due to wafer positions. 

(c) Estimate the random error component. 

(d) Analyze the residuals from this experiment and comment on model adequacy. 
Consider the vapor-deposition experiment described in Problem 12-4. 

(a) Estimate the total variability in the uniformity response. 

(b) How much of the total variability in the uniformity response is due to the difference 

between positions in the reactor? , 

(c) To what level could the variability in the uniformity response be reduced if the 
position-to-position variability in the reactor could be eliminated? Do you believe 
this is a significant reduction? 

An article in the Journal of Quality Technology (Vol. 13, No. 2, 1981, pp. Ill 114) 
describes an experiment that investigates the effects of four bleaching chemicals on pulp 
brightness. These four chemicals were selected at random from a large population ol 
potential bleaching agents. The data are as follows: 

Htomcal Pulp Brightness^ 

1 77.199 74.466 92.746 

2 80.522 79.306 81.914 

3 79.417 78.017 91.596 

4 78.001 78.358 77.544 


(a) Is there a difference in the chemical types? Use a = 0.05. 

(b) Estimate the variability due to chemical types. 

(c) Estimate the variability due to random error. 

(d) Analyze the residuals from this experiment and comment on model adequacy. 

12-7. Consider the one-way, balanced, random effects method. Develop a procedure for finding 

a 100(1 - a) percent confidence interval for <r7(o^ + <r 2 ). 

12-8. Refer to Problem 12-1. 2o 

(a) What is the probability of accepting H 0 if <r T is 4 times the error variance a . 

(b) If the difference between looms is large enough to increase the standard deviation 
of an observation by 20 percent, we wish to detect this with a probability of at least 
0.80. What sample size should be used? 

12-9 An experiment was performed to investigate the capability of a measurement system. 
Ten parts were randomly selected, and two randomly selected operators measured each 
part three times. The tests were made in random order, and the data below resulted. 


Part 

Number 

Operator 1 
Measurements 


Operator 2 
Measurements 


1 

2 

3 

1 

2 

3 

j 

50 

49 

50 

50 

48 

51 

2 

52 

52 

51 

51 

51 

51 

3 

53 

50 

50 

54 

52 

51 

4 

49 

51 

50 

48 

50 

51 

5 

48 

49 

48 

48 

49 

48 

6 

52 

50 

50 

52 

50 

50 

7 

51 

51 

51 

51 

50 

50 

8 

52 

50 

49 

53 

48 

50 

9 

50 

51 

50 

51 

48 

49 

10 

47 

46 

49 

46 

47 

48 


76.208 82.876 

80.346 73.385 

80.802 80.626 

77.364 77.386 
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(a) Analyze the data from this experiment. 

(b) Find point estimates of the variance components using the analysis of variance 
method. 

12-10. Reconsider the data in Problem 5-6. Suppose that both factors, machines and operators, 
are chosen at random. 

(a) Analyze the data from this experiment. 

(b) Find point estimates of the variance components using the analysis of variance 
method. 

12-11. Reconsider the data in Problem 5-13. Suppose that both factors are random. 

(a) Analyze the data from this experiment. 

(b) Estimate the variance components. 

12-12. Suppose that in Problem 5-11 the furnace positions were randomly selected, resulting in 
a mixed model experiment. Reanalyze the data from this experiment under this new 
assumption. Estimate the appropriate model components. 

12-13. Reanalyze the measurement systems experiment in Problem 12-9, assuming that opera¬ 
tors are a fixed factor. Estimate the appropriate model components. 

12-14. In Problem 5-6, suppose that there are only four machines of interest, but the operators 
were selected at random. 

(a) What type of model is appropriate? 

(b) Perform the analysis and estimate the model components. 

12-15. By application of the expectation operator, develop the expected mean squares for the 
two-factor factorial, mixed model. Use the restricted model assumptions. Check your 
results with the expected mean squares given in Table 12-11 to see that they agree. 
12-16. Consider the three-factor factorial design in Example 12-6. Propose appropriate test 
statistics for all main effects and interactions. Repeat for the case where A and B are 
fixed and C is random. 

12-17. Consider the experiment in Example 12-7. Analyze the data for the case where A, B, and 
C are random. 

12-18. Derive the expected mean squares shown in Table 12-14. 

12-19. Consider a four-factor factorial experiment where factor A is at a levels, factor B is at b 
levels, factor C is at c levels, factor D is at d levels, and there are n replicates. Write 
down the sums of squares, the degrees of freedom, and the expected mean squares for 
the following cases. Assume the restricted model for all mixed models. You may use a 
computer package such as Minitab. 

(a) A, B, C, and D are fixed factors. 

(b) A, B, C, and D are random factors. 

(c) A is fixed and B, C, and D are random. 

(d) A and B are fixed and C and D are random. 

(e) A, B, and C are fixed and D is random. 

Do exact tests exist for all effects? If not, propose test statistics for those effects that 
cannot be directly tested. 

12-20. Reconsider cases (c), (d), and (e) of Problem 12-19. Obtain the expected mean squares 
assuming the unrestricted model. You may use a computer package such as Minitab. 
Compare your results with those for the restricted model. 

12-21. In Problem 5-17, assume that the three operators were selected at random. Analyze the 
data under these conditions and draw conclusions. Estimate the variance components. 
12-22. Consider the three-factor factorial model 

n = i,2,...,« 

yijk = p, + T; + j8 i + y k + (Tj8),y + (fiy) jk + e ijk ij=l,2,...,b 

(* = 1, 2,..., c 

Assuming that all the factors are random, develop the analysis of variance table, including 
the expected mean squares. Propose appropriate test statistics for all effects. 
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12-23. The three-factor factorial model for a single replicate is 

y ijk = p, + t ; + ft + y k + (tj 8) 0 + Q3y) y * + (ry)« + (T/3y)„, + e, y * 

If all the factors are random, can any effects be tested? If the three-factor and (r/3), y 
interactions do not exist, can all the remaining effects be tested? 

12-24. In Problem 5-6, assume that both machines and operators were chosen randomly. De¬ 
termine the power of the test for detecting a machine effect such that a-% = a 2 , where 
cr| is the variance component for the machine factor. Are two replicates sufficient? 

12-25. In the two-factor mixed model analysis of variance, show that Cov[(t/3), 7 , (t/ 3), v ] = 
-(l/a)o ^3 for i + i'. 

12-26. Show that the method of analysis of variance always produces unbiased point estimates 
of the variance components in any random or mixed model. 

12-27. Invoking the usual normality assumptions, find an expression for the probability that a 
negative estimate of a variance component will be obtained by the analysis of variance 
method. Using this result, write a statement giving the probability that ai < 0 in a one- 
factor analysis of variance. Comment on the usefulness of this probability statement. 

12-28. Analyze the data in Problem 12-9, assuming that operators are fixed, using both the 
unrestricted and the restricted forms of the mixed models. Compare the results obtained 
from the two models. 

12-29. Consider the two-factor mixed model. Show that the standard error of the fixed factor 
mean (e.g.. A) is [MS AB /bn] 1,2 . 

12-30. Consider the variance components in the random model from Problem 12-9. 

(a) Find an exact 95 percent confidence interval on a 2 . 

(b) Find approximate 95 percent confidence intervals on the other variance components 
using the Satterthwaite method. 

12-31. Use the experiment described in Problem 5-6 and assume that both factors are random. 
Find an exact 95 percent confidence interval on a 2 . Construct approximate 95 percent 
confidence intervals on the other variance components using the Satterthwaite method. 

12-32. Consider the three-factor experiment in Problem 5-17 and assume that operators were 
selected at random. Find an approximate 95 percent confidence interval on the operator 
variance component. 

12-33. Rework Problem 12-30 using the modified large-sample approach described in Section 
12-7.2. Compare the two sets of confidence intervals obtained and discuss. 

12-34. Rework Problem 12-32 using the modified large-sample method described in Section 
12-7.2. Compare this confidence interval with the one obtained previously and discuss. 





Nested and _ 

Split'Plot Designs 


This chapter introduces two important types of experimental designs, the nested design 
and the split-plot design. Both of these designs find reasonably widespread application 
in the industrial use of designed experiments. They also frequently involve one or more 
random factors, and so some of the concepts introduced in Chapter 12 will find appli¬ 
cation here. 


13-1 THE TWO-STAGE NESTED DESIGN 

In certain multifactor experiments the levels of one factor (e.g., factor B) are similar but 
not identical for different levels of another factor (e.g., A). Such an arrangement is called 
a nested, or hierarchical, design, with the levels of factor B nested under the levels of 
factor A. For example, consider a company that purchases its raw material from three 
different suppliers. The company wishes to determine if the purity of the raw material is 
the same from each supplier. There are four batches of raw material available from each 
supplier, and three determinations of purity are to be taken from each batch. The situation 
is depicted in Figure 13-1 on the next page. 

This is a two-stage nested design, with batches nested under suppliers. At first 
glance, you may ask why this is not a factorial experiment. If this were a factorial, then 
batch 1 would always refer to the same batch, batch 2 would always refer to the same 
batch, and so on. This is clearly not the case because the batches from each supplier are 
unique for that particular supplier. That is, batch 1 from supplier 1 has no connection 
with batch 1 from any other supplier, batch 2 from supplier 1 has no connection with 
batch 2 from any other supplier, and so forth. To emphasize the fact that the batches 
from each supplier are different batches, we may renumber the batches as 1, 2, 3, and 4 
from supplier 1; 5, 6, 7, and 8 from supplier 2; and 9, 10, 11, and 12 from supplier 3, as 
shown in Figure 13-2 on the next page. 

Sometimes we may not know whether a factor is crossed in a factorial arrangement 
or nested. If the levels of the factor can be renumbered arbitrarily as in Figure 13-2, then 
the factor is nested. 
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Batches 



C >111 >121 5-131 >14, 

Observations < y u2 y u2 y 132 y u2 

L >113 5 123 5- 133 5-143 

Figure 13-1 A two-stage nested design. 




5-311 5-321 5-331 >-341 

5-312 5-322 5-332 5-342 

5313 5323 5333 5343 


13-1.1 Statistical Analysis 


The linear statistical model for the two-stage nested design is 


ytjk = H + Ti + p m + e (W * 


i =1,2,... 

j = 1.2,... 

k = 1 , 2 ,... 


a 

6 


(13-1) 


That is, there are a levels of factor A, b levels of factor B nested under each level of A, 
and n replicates. The subscript j(i) indicates that the/th level of factor B is nested under 
the ith level of factor A. It is convenient to think of the replicates as being nested within 
the combination of levels of A and B\ thus, the subscript (if)k is used for the error term. 
This is a balanced nested design because there are an equal number of levels of B within 
each level of A and an equal number of replicates. Because every level of factor B does 
not appear with every level of factor A, there can be no interaction between A and B. 

We may write the total corrected sum of squares as 


£ £ £ <y» - y-f = £ £ £ K*. - yj + Oti. - *5 + - Wf am 

Expanding the right-hand side of Equation 13-2 yields 

E E E (y.j* - yJ = bn E ~ y ~ )2 + n E E ^ _ (i3 3) 

+ E E E _ yyf 

because the three cross-product terms are zero. Equation 13-3 indicates that the total sum 
of squares can be partitioned into a sum of squares due to factor A, a sum of squares due 
to factor B under the levels of A, and a sum of squares due to error. Symbolically, we 
may write Equation 13-3 as 

SS T = SS A + SS m) + SS E (13-4) 

There are abn - 1 degrees of freedom for SS T , a - 1 degrees of freedom for SS A , 
a(b - 1) degrees of freedom for SS B(A) , and ab(n - 1) degrees of freedom for error. 
Note that abn - 1 = (a - 1) + a(b - 1) + ab{n - 1). If the errors are NID(0, a 2 ), we 
may divide each sum of squares on the right of Equation 13-4 by its degrees of freedom 


Batches 




Figure 13-2 Alternate layout for the two-stage nested design. 
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Table 13-1 Expected Mean Squares in the Two-Stage Nested Design 



A Fixed 

A Fixed 

A Random 

E(MS) 

B Fixed 

B Random 

B Random 

E(MS a ) 

bn'Zrf 

bn'Zrf 
a 2 + ncTp H-j— 

a 2 + ncrjj + bnoi 

E(MS b<A) ) 

»n/s, 

* + a(b- 1) 

a 2 + najt 

a 2 Anal 

E(MS e ) 

a 2 

a 2 

o 2 


to obtain independently distributed mean squares such that the ratio of any two mean 
squares is distributed as F. 

The appropriate statistics for testing the effects of factors A and B depend on whether 
A and B are fixed or random. If factors A and B are fixed, we assume that 2“ =1 t, = 0 
and SjLi fij X0 = 0 (/ = 1, 2,.... a). That is, the A treatment effects sum to zero, and 
the B treatment effects sum to zero within each level of A. Alternatively, if A and B are 
random, we assume that r, is NID(0, cr 2 T ) and /3, (0 is NID(0, crj|). Mixed models with A 
fixed and B random are also widely encountered. The expected mean squares can be 
determined by a straightforward application of the rules in Chapter 12. For the mixed 
model, these expected mean squares assume the restricted form of the model in Chapter 
12. Table 13-1 gives the expected mean squares for these situations. 

Table 13-1 indicates that if the levels of A and B are fixed, H 0 : r, = 0 is tested by 
MSJMSe and H 0 : /3, (0 = 0 is tested by MS B(A) /MS E . If A is a fixed factor and B is random, 
then H 0 : r, = 0 is tested by MS A /MS B(A) and H 0 : = 0 is tested by MS B{A) /MS E . Finally, 

if both A and B are random factors, we test H 0 :a 2 = 0 by MSJMS BW and H 0 : <r 2 p = 0 
by MS B(A) /MS e . The test procedure is summarized in an analysis of variance table as 
shown in Table 13-2. Computing formulas for the sums of squares may be obtained by 
expanding the quantities in Equation 13-3 and simplifying. They are 



ss A = l 

bn ,=i abn 


(13-5) 


SS BW = ~ n i) 2 yfj. ~ y n S yl. 

(13-6) 


SS E = 2 i i yf jk - 


(13-7) 


ss r = 2 2 2 fak - 

yl. 

abn 

(13-8) 

Table 13-2 

Analysis of Variance Table for the Two-Stag( 

t Nested Design 


Source of 


Degrees of 

Mean 

Variation 

Sum of Squares 

Freedom 

Square 

A 

bn 2 (Pl. ~ y..f 

a - 1 

ms a 

B within A 

«2 2 (y,. - a ..) 2 

a(b - 1) 

ms bw 

Error 

2 2 2 (y* - y.jf 

ab(n - 1) 

ms e 

Total 

2 2 2 (y u * ~ fJ 1 

abn - 1 
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We see that Equation 13-6 for SS B(A) can be written as 

SSbw = % ^ 

This expresses the idea that SS B(A) is the sum of squares between levels of B for each 
level of A, summed over all the levels of A. 


EXAMPLE 13-1 . 

Consider a company that buys raw material in batches from three different suppliers. The 
purity of this raw material varies considerably, which causes problems in manufacturing 
the finished product. We wish to determine if the variability in purity is attributable to 
differences between the suppliers. Four batches of raw material are selected at random 
from each supplier, and three determinations of purity are made on each batch. This is, 
of course, a two-stage nested design. The data, after coding by subtracting 93, are shown 
in Table 13-3. The sums of squares are computed as follows: 


SSt S £ £ y ” k abn 


bn y " abn 

= —— [(—5) 2 + (4) 2 + (14) 2 ] 
(4)(3) LV ' 

= 19.75 - 4.69 = 15.06 

= \ [(0) 2 + (—9) 2 + (-1) 2 + • 
= 89.67 - 19.75 = 69.92 


(13 f 
36 


+ (2) 2 + (6) 2 ] - 19.75 


and 

sSs = iitA-- n iiy}, 

= 153.00 - 89.67 = 63.33 


Table 13-3 Coded Purity Data for Example 13-1 (Code: y ijk = purity - 93) _ 

Supplier 1 Supplier 2 Supplier 3 

_ Batches 1 _ 2 34123412 34 

1 —2 —2 1 1 0 -1 0 2 —2 1 3 

-1 -3 0 4 -2 4 0 3 4 0 -1 2 

0 -4 1 0 -3 2 -2 2 0 2 2 1 

Batch totals y u . 0 -9 -1 5 -4 6 -3 5 6 0 2 6 

Supplier totals y,.. -5 4 14 
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Table 13-4 Analysis of Variance for the Data in Example 13-1 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Expected Mean 
Square 

F„ 

F-Value 

Suppliers 

15.06 

2 

7.53 

a 2 + 3<jp + 6^7; 

0.97 

0.42 

Batches (within suppliers) 

69.92 

9 

7.77 

a 2 + 3tr| 

2.94 

0.02 

Error 

63.33 

24 

2.64 

a 2 



Total 

148.31 

35 






The analysis of variance is summarized in Table 13-4. Suppliers are fixed and batches 
are random, so the expected mean squares are obtained from the middle column of Table 
13-1. They are repeated for convenience in Table 13-4. From examining the P values, 
we would conclude that there is no significant effect on purity due to suppliers, but the 
purity of batches of raw material from the same supplier does differ significantly. 


The practical implications of this experiment and the analysis are very important. 
The objective of the experimenter is to find the source of the variability in raw material 
purity. If it results from differences among suppliers, we may be able to solve the problem 
by selecting the “best” supplier. However, that solution is not applicable here because 
the major source of variability is the batch-to-batch purity variation within suppliers. 
Therefore, we must attack the problem by working with the suppliers to reduce their 
batch-to-batch variability. This may involve modifications to the suppliers’ production 
processes or their internal quality assurance system. 

Notice what would have happened if we had incorrectly analyzed this design as a 
two-factor factorial experiment. If batches are considered to be crossed with suppliers, 
we obtain batch totals of 2, -3, —2, and 16, with each batch X suppliers cell containing 
three replicates. Thus, a sum of squares due to batches and an interaction sum of squares 
can be computed. The complete factorial analysis of variance is shown in Table 13-5, 
assuming the mixed model. 

This analysis indicates that batches differ significantly and that there is a significant 
interaction between batches and suppliers. However, it is difficult to give a practical 
interpretation of the batches X suppliers interaction. For example, does this significant 
interaction mean that the supplier effect is not constant from batch to batch? Furthermore, 
the significant interaction coupled with the nonsignificant supplier effect could lead the 
analyst to conclude that suppliers really differ but their effect is masked by the significant 
interaction. 


Table 13-5 Incorrect Analysis of the Two-Stage Nested Design in Example 13-1 as a Factorial 


(Suppliers Fixed, Batches Random) 

Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

F-Value 

Suppliers ( S ) 

15.06 

2 

7.53 

1.02 

0.42 

Batches (B) 

25.64 

3 

8.55 

3.24 

0.04 

S X B interaction 

44.28 

6 

7.38 

2.80 

0.03 

Error 

63.33 

24 

2.64 



Total 

148.31 

35 
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Some statistical software packages will perform the analysis for a nested design. Table 
13-6 presents the output from the Balanced ANOVA procedure in Minitab (using the 
restricted model). The numerical results are in agreement with the manual calculations 
reported in Table 13-4. Minitab also reports the expected mean squares in the lower 
portion of Table 13-6. Remember that the symbol Q[ 1] is a quadratic term that represents 
the fixed effect of suppliers, so in our notation, 

GM = ^17 

Therefore, the fixed effect term in the Minitab expected mean square for suppliers 
\2Q[\\ = 122f=i rf/(3 — 1) = 6 if, which matches the result given by the tabular 

algorithm in Table 13-4. 

Sometimes a specialized computer program for analyzing nested designs is not avail¬ 
able. However, notice from comparing Tables 13-4 and 13-5 that 

SS B + SS SXB = 25.64 + 44.28 = 69.92 - SS B(S) 

That is, the sum of squares for batches within suppliers consist of the sum of squares of 
the batches plus the sum of squares for the batches X suppliers interaction. The degrees 
of freedom have a similar property; that is, 

Batches Batches X Suppliers _ Batches wi thin Suppliers 
3 + 6 9 

Therefore, a computer program for analyzing factorial designs could also be used for the 
analysis of nested designs by pooling the “main effect” of the nested factor and inter¬ 
actions of that factor with the factor under which it is nested. 


Table 1 3-6 Minicab Output (Balanced ANOVA) for Example 13-1 
Analysis of Variance (Balanced Designs) 


Factor 

Supplier 

Batch(Supplier) 


Type Levels Values 
fixed 3 1 

random 4 1 


2 

2 


3 

3 


Analysis of Variance for Purity 


Source 
Supplie r 
Ba t c h(Supp l i 
Error 
Total 


SS 

15.056 

69.917 

63.333 

148.306 


MS F P 
7.528 0.97 0.416 
7.769 2.94 0.017 
2.639 


Source Variance Error 

component term 

1 Supplier 2 

2 Batch(Supplier) 1.710 3 

3 Error 2.639 


Expected Mean Square for Each Ter 
(using restricted model) 

(3) + 3(2) + 12Qn: 

(3) + 3(2) 

(3) 
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13-1.2 Diagnostic Checking 


The major tool used in diagnostic checking is residual analysis. For the two-stage nested 
design, the residuals are 


The fitted value is 


e ijk — yijk y ijk 


Sm = A + % + Axo 

and if we make the usual restrictions on the model parameters (£, t, = 0 and £, /3 y(() = 0, 
i = 1,2 a), then (L = y..., f,- = y,„ — y.„, and = y ljm — y,„. Consequently, the 
fitted value is 

Sijk = y... + ( y,.. ~ y.J + (jij. ~ y J 

= 

Thus, the residuals from the two-stage nested design are 

e ijk = yijk ~ Vy. (13-9) 

where y, y . are the individual batch averages. 

The observations, fitted values, and residuals for the purity data in Example 13-1 
follow: 


Observed Value y ijk Fitted Value y iJk = y, y . e iJk = y lJk — y u . 


0 

-2 

-3 

-4 

-2 

0 


4 

0 

-2 

-3 

0 

4 

2 

-1 

0 

-2 

0 

3 

2 

2 


0.00 

0.00 

0.00 

-3.00 

-3.00 

-3.00 

-0.33 

-0.33 

-0.33 

1.67 

1.67 

1.67 

-1.33 

-1.33 

-1.33 

2.00 

2.00 

2.00 

-1.00 

-1.00 

-1.00 

1.67 

1.67 

1.67 

2.00 


1.00 

-1.00 

0.00 

1.00 

0.00 

-1.00 

-1.67 

0.33 

1.33 
-0.67 

2.33 
-1.67 

2.33 
-0.67 
-1.67 
-2.00 

2.00 

0.00 

0.00 

1.00 

-1.00 

-1.67 

1.33 
0.33 
0.00 



564 CHAPTER 13 NESTED AND SPLIT-PLOT DESIGNS 


Observed Value y tjk 

Fitted Value y ijt = y tj . 

e uk = yuk ~ %. 

4 

2.00 

2.00 

0 

2.00 

-2.00 

-2 

0.00 

-2.00 

0 

0.00 

0.00 

2 

0.00 

2.00 

1 

0.67 

0.33 

-1 

0.67 

-1.67 

2 

0.67 

1.33 

3 

2.00 

1.00 

2 

2.00 

0.00 

1 

2.00 

-1.00 


The usual diagnostic checks—including normal probability plots, checking for outliers, 
and plotting the residuals versus fitted values—may now be performed. As an illustration, 
the residuals are plotted versus the fitted values and against the levels of the supplier 
factor in Figure 13-3. 



(a) Plot of residuals versus the predicted values 



Supplier 

(6) Plot of residuals versus supplier 
Figure 13-3 Residual plots for Example 13-1. 
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In a problem situation such as that described in Example 13-1, the residual plots are 
particularly useful because of the additional diagnostic information they contain. For 
instance, the analysis of variance has indicated that the mean purity of all three suppliers 
does not differ but that there is statistically significant batch-to-batch variability (that is, 
<jp > 0). But is the variability within batches the same for all suppliers? In effect we 
have assumed this to be the case, and if it’s not true we would certainly like to know it, 
because it has considerable practical impact on our interpretation of the results of the 
experiment. The plot of residuals versus suppliers in Figure 13-3& is a simple but effective 
way to check this assumption. Because the spread of the residuals is about the same for 
all three suppliers, we would conclude that the batch-to-batch variability in purity is 
about the same for all three suppliers. 


13-1.3 Variance Components 

For the random effects case, the analysis of variance method can be used to estimate the 
variance components cr 2 ,cr|, and cr 2 r . From the expected mean squares in the last column 
of Table 13-1, we obtain 

& 2 = MS e 

MS B (a ) — MS e 

Vp = --- 


(13-10) 

(13-11) 


£.2 _ MS a MS B (a) 
bn 


(13-12) 


Many applications of nested designs involve a mixed model, with the main factor 
(A) fixed and the nested factor (B) random. This is the case for the problem described in 
Example 13-1; suppliers (factor .A) are fixed, and batches of raw material (factor B) are 
random. The effects of the suppliers may be estimated by 


ti = yi_ - y... 
1 2 = yz. ~ y... 
r 3 = y 3 .. ~ y... 


-5 13 -28 

12 36 36 

_4 _ 13 = -1 
12 36 “ 36 

14 _ 13 = 29 
12 36 36 


To estimate the variance components cr 2 and <Tp, we eliminate the line in the analysis of 
variance table pertaining to suppliers and apply the analysis of variance estimation 
method to the next two lines. This yields 

& 2 = MS e = 2.64 


and 


& l = 


MS b (4) MS e 

n 


7.77 - 2.64 


1.71 


3 
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Stage 1 


Stage 2 


Stage 3 


Lot 1 ... 



Figure 13-4 A three-stage staggered nested design. 


These results are also shown in the lower portion of the Minitab output in Table 13-6. 
From the analysis in Example 13-1, we know that the r, do not differ significantly from 
zero, whereas the variance component cr| is greater than zero. 


13-1.4 Staggered Nested Designs 

A potential problem in the application of nested designs is that sometimes to get a 
reasonable number of degrees of freedom at the highest level, we can end up with many 
degrees of freedom (perhaps too many) at lower stages. To illustrate, suppose that we 
are investigating potential differences in chemical analysis among different lots of ma¬ 
terial. We plan to take five samples per lot, and each sample will be measured twice. If 
we want to estimate a variance component for lots, then 10 lots would not be an unrea¬ 
sonable choice. This results in 9 degrees of freedom for lots, 40 degrees of freedom for 
samples, and 50 degrees of freedom for measurements. 

One way to avoid this is to use a particular type of unbalanced nested design called 
a staggered nested design. An example of a staggered nested design is shown in Figure 
13-4. Notice that only two samples are taken from each lot; one of the samples is mea¬ 
sured twice, whereas the other sample is measured once. If there are a lots, then there 
will be a - 1 degrees of freedom for lots (or, in general, the upper stage), and all lower 
stages will have exactly a degrees of freedom. For more information on the use and 
analysis of these designs, see Bainbridge (1965), Smith and Beverly (1981), Nelson 
(1983, 1995a, 1995b), and the supplemental text material for this chapter. 


13-2 THE GENERAL m-STAGE NESTED DESIGN 

The results of Section 13-1 can be easily extended to the case of m completely nested 
factors. Such a design would be called an m-stage nested design. As an example, suppose 
a foundry wishes to investigate the hardness of two different formulations of a metal 
alloy. Three heats of each alloy formulation are prepared, two ingots are selected at 
random from each heat for testing, and two hardness measurements are made on each 
ingot. The situation is illustrated in Figure 13-5 on the facing page. 

In this experiment, heats are nested under the levels of the factor alloy formulation, 
and ingots are nested under the levels of the factor heats. Thus, this is a three-stage nested 
design with two replicates. 
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The model for the general three-stage nested design is 

{ i = 1,2 

i = \,2 ’ b c (13 ' 13) 

1= 1 , 2 ,..., n 

For our example, r, is the effect of the ith alloy formulation, j8 y0) is the effect of the jth 
heat within the ith alloy, y k(iJ) is the effect of the £th ingot within the jth heat and ith 



Figure 13-6 Sources of variation in the three-stage nested design example. 
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Table 13-7 Analysis of Variance for the Three-Stage Nested Design 


Source of 
Variation 

Sum of Squares 

Degrees of 
Freedom 

Mean 

Square 

A 

ben 2 (ji~ ~~ y~~) 2 

a - 1 

MS a 

B (within A) 

(*/.. - 3o 2 

a(b - 1) 

ms B(A) 

C (within B) 

«2 2 2 Gut. - y~f 

ab(c - 1) 

MS cm 

Error 

2 2 2 2 C y V ki - yukf 

abc(n — 1) 

ms e 

Total 

2 2 2 2 Cy</« - y.J 

aben - 1 



alloy, and e Ujk)l is the usual NID(0, a 2 ) error term. Extension of this model to m factors 
is straightforward. 

Notice that in the above example the overall variability in hardness consists of three 
components: one that results from alloy formulations, one that results from heats, and 
one that results from analytical test error. These components of the variability m overall 
hardness are illustrated in Figure 13-6 on the previous page. 

This example demonstrates how the nested design is often used in analyzing pro¬ 
cesses to identify the major sources of variability in the output. For instance, if the alloy 
formulation variance component is large, then this implies that overall hardness van- 
ability could be reduced by using only one alloy formulation. 

The calculation of the sums of squares and the analysis of variance for the m-stage 
nested design are similar to the analysis presented in Section 13-1. For example, the 
analysis of variance for the three-stage nested design is summarized in Table 13-7. Def¬ 
initions of the sums of squares are also shown in this table. Notice that they are a simple 
extension of the formulas for the two-stage nested design. Many statistics software pack¬ 
ages will perform the calculations. 

To determine the proper test statistics we must find the expected mean squares using 
the methods of Chapter 12. For example, if factors A and B are fixed and factor C is 
random, then we may derive the expected mean squares as shown in Table 13-8. This 
table indicates the proper test statistics for this situation. 


Table 13-8 

Expected Mean Square Derivation for a 
with A and B Fixed and C Random 

i Three-Stage Nested Design 



F 

F R 

b c 

j k 

R 

l 

Expected Mean Square 

T 

0 

b c 

n 

a 2 + ncrl 

bCfl 2 j 

' + -i - 

ftw 

1 

0 c 

n 

a 2 + no 2 

cn 2 2 Ph 

' + a(b - 1) 

r«y) 

1 

1 1 

1 1 

1 

a 2 + na 2 
a 2 
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13-3 DESIGNS WITH BOTH NESTED AND FACTORIAL FACTORS 


Occasionally in a multifactor experiment, some factors are arranged in a factorial layout 
and other factors are nested. We sometimes call these designs nested-factorial designs. 
The statistical analysis of one such design with three factors is illustrated in the following 
example. 


EXAMPLE 13-2 


An industrial engineer is studying the hand insertion of electronic components on printed 
circuit boards to improve the speed of the assembly operation. He has designed three 
assembly fixtures and two workplace layouts that seem promising. Operators are required 
to perform the assembly, and it is decided to randomly select four operators for each 
fixture-layout combination. However, because the workplaces are in different locations 
within the plant, it is difficult to use the same four operators for each layout. Therefore, 
the four operators chosen for layout 1 are different individuals from the four operators 
chosen for layout 2. Because there are only three fixtures and two layouts, but the op¬ 
erators are chosen at random, this is a mixed model. The treatment combinations in this 
design are run in random order and two replicates are obtained. The assembly times are 
measured in seconds and are shown in Table 13-9. 

In this experiment, operators are nested within the levels of layouts, whereas fixtures 
and layouts are arranged in a factorial. Thus, this design has both nested and factorial 
factors. The linear model for this design is 


yw = m t + ft + 7*0) + (r P)u + (tjOuu) + £(ijk)i 


'i = 1, 2, 3 
j = 1, 2 

k = 1, 2, 3, 4 
J- 1 , 2 


(13-14) 


where r, is the effect of the ith fixture, j8, is the effect of the y'th layout, y k(J) is the effect 
of the Mi operator within the jth level of layout, (t/ 3), 7 is the fixture X layout interaction, 
(77);*(,) is the fixture X operators within layout interaction, and e m) , is the usual error 
term. Notice that no layout X operator interaction can exist because all the operators do 
not use all the layouts. Similarly, there can be no three-way fixture X layout X operator 
interaction. The expected mean squares are derived in Table 13-10 (page 570) using the 
tabular algorithm of Chapter 12. This produces a restricted mixed model analysis. The 
proper test statistic for any effect or interaction can be found from inspection of this 
table. 


Table 13-9 Assembly Time Data for Example 13-2 

Operator 


Layout 1 



Layout 2 


>/... 

1 

2 

3 

4 

1 

2 

3 

4 

Fixture 1 

22 

23 

28 

25 

26 

27 

28 

24 

404 


24 

24 

29 

23 

28 

25 

25 

23 


Fixture 2 

30 

29 

30 

27 

29 

30 

24 

28 

447 


27 

28 

32 

25 

28 

27 

23 

30 


Fixture 3 

25 

24 

27 

26 

27 

26 

24 

28 

401 


21 

22 

25 

23 

25 

24 

27 

_27 


Operator totals, y, jk . 

149 

150 

171 

149 

163 

159 

151 

160 


Layout totals, y,j„ 


619 



633 


1252 = y.„. 
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Table 13-10 Expected Mean Square Derivation for Example 13-2 



F 

3 

F 

2 

j 

R 

4 

k 

R 

2 

l 

Expected Mean Square 

T . 

0 

2 

4 

2 

a 2 + 2a 2 ry + 8 2 T ? 

Pj 

3 

0 

4 

2 

a- 2 + 6a 2 y + 24 2 $ 

7m 

3 

1 

1 

2 

a 2 + 6 a 2 

m v 

0 

0 

4 

2 

O- 2 + 2(7^ + 4 2 X ml 

(Ty)i«j) 

0 

1 

1 

2 

a 2 + 2 o 2 . 


1 

1 

1 

1 

0-2 


The complete analysis of variance is shown in Table 13-11. We see that assembly 
fixtures are significant and that operators within layouts also differ significantly. There 
is also a significant interaction between fixtures and operators within layouts, indicating 
that the effects of the different fixtures are not the same for all operators. The workplace 
layouts seem to have little effect on the assembly time. Therefore, to minimize assembly 
time, we should concentrate on fixture types 1 and 3. (Note that the fixture totals in Table 
13-9 are smaller for fixture types 1 and 3 than for type 2. This difference in fixture type 
means could be formally tested using multiple comparisons.) Furthermore, the interaction 
between operators and fixtures implies that some operators are more effective than others 
using the same fixtures. Perhaps these operator-fixture effects could be isolated and the 
less-effective operators’ performance improved by retraining them. 


Computing 

There are a number of statistical software packages that can easily analyze nested-fac¬ 
torial designs, including both Minitab and SAS. Table 13-12 (page 571) presents the 
output from Minitab (Balanced ANOVA), assuming the restricted form of the mixed 
model, for Example 13-2. The expected mean squares in the bottom portion of Table 13- 
12 agree with those derived by the tabular method in Table 13-10. Q[ 1], Q[ 3], and Q[ 4] 
are the fixed-factor effects for layouts, fixtures, and layouts X fixtures, respectively. The 
estimates of the variance components are: 

Operator (layout): cr 2 — 1.609 

Fixture x Operator (layout): a%, = 1.576 

Error: tr 2 = 2.333 


Table 13-11 Analysis of Variance for Example 13-2 


Sum of Degrees of 

Source of Variation Squares Freedom 

Fixtures (F) 82.80 2 

Layouts (L) 4.08 1 

Operators (within layouts), 0{L) 71.91 6 

FL 19.04 2 

FO{L) 65.84 12 

Error 56.00 24 

Total 299.67 47 


Mean 

Square F 0 P-Value 


41.40 

4.09 

11.99 

9.52 

5.49 

2.33 


7.54 

0.34 

5.15 

1.73 

2.36 


0.01 

0.58 

<0.01 

0.22 

0.04 




Analysis of Variance (Balanced Designs) 



571 







Table 13-13 Minitab Balanced ANOVA Analysis of Example 13-2 Using the Unrestricted Model 
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Layout* Fixture 
Fixture*Operator(Layout> 
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Table 13-13 (page 572) presents the Minitab analysis of Example 13-2 using the 
unrestricted form of the mixed model. The expected mean squares in the lower portion 
of this table are slightly different from those reported for the restricted model, and so the 
construction of the test statistic will be slightly different for the operators (layout) factor. 
Specifically, the F ratio denominator for operators (layout) is the fixtures X operators 
(layout) interaction in the restricted model (12 degrees of freedom for error), and it is 
the layout X fixtures interaction in the unrestricted model (2 degrees of freedom for 
error). Because MS layout x flxture > MSrmurexoperatoniayout). and it has fewer degrees of free¬ 
dom, we now find that the operator within layout effect is only significant at about the 
12 percent level (the P value was 0.002 in the restricted model analysis). Furthermore, 
the variance component estimate & 2 = 1.083 is smaller. However, because there is a 
large fixture effect and a significant fixture X operator (layout) interaction, we would 
still suspect that there is an operator effect, and so the practical conclusions from this 
experiment are not greatly affected by choosing either the restricted or the unrestricted 
form of the mixed model. The quantities Q[ 1, 4] and Q[3, 4] are fixed-type quadratic 
terms containing the interaction effect of layouts X fixtures. 

If there is no specialized software such as SAS or Minitab available, then a program 
for analyzing factorial experiments can be used to analyze experiments with nested and 
factorial factors. For instance, the experiment in Example 13-2 could be considered as a 
three-factor factorial, with fixtures (F), operators (0), and layouts (L) as the factors. Then 
certain sums of squares and degrees of freedom from the factorial analysis would be 
pooled to form the appropriate quantities required for the design with nested and factorial 
factors as follows: 


Factorial Analysis 


Nested-Factorial Analysis 


Sum of Squares 

Degrees of 
Freedom 

Sum of Squares 

Degrees of 
Freedom 

SS F 

2 

SS F 

2 

SS L 

1 

SS L 

1 

SS FL 

2 

SS FL 

2 

SS 0 

3 

SS 0(L) = SS 0 + SSlo 

6 

SS LO 

3 



SS FO 

6 



SS FOL 

6 

SS FO(L) = SS FO + SS FOL 

12 

ss E 

24 

SS E 

24 

SS T 

47 

SS T 

47 


13-4 THE SPLIT-PLOT DESIGN 

In some multifactor factorial experiments we may be unable to completely randomize 
the order of the runs. This often results in a generalization of the factorial design called 

a split-plot design. 

As an example, consider a paper manufacturer who is interested in three different 
pulp preparation methods and four different cooking temperatures for the pulp and who 
wishes to study the effect of these two factors on the tensile strength of the paper. Each 
replicate of a factorial experiment requires 12 observations, and the experimenter has 
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decided to run three replicates. However, the pilot plant is only capable of making 12 
runs per day, so the experimenter decides to run one replicate on each of the three days 
and to consider the days or replicates as blocks. On any day, he conducts the experiment 
as follows. A batch of pulp is produced by one of the three methods under study. Then 
this batch is divided into four samples, and each sample is cooked at one of the four 
temperatures. Then a second batch of pulp is made up using another of the three methods. 
This second batch is also divided into four samples that are tested at the four temperatures. 
The process is then repeated, using a batch of pulp produced by the third method. The 
data are shown in Table 13-14. 

Initially, we might consider this to be a factorial experiment with three levels of 
preparation method (factor A) and four levels of temperature (factor B) in a randomized 
block. If this is the case, then the order of experimentation within each replicate or block 
should be completely randomized. That is, within a block, we should randomly select a 
treatment combination (a preparation method and a temperature) and obtain an obser¬ 
vation, then we should randomly select another treatment combination and obtain a sec¬ 
ond observation, and so on, until the 12 observations in the block have been taken. 
However, the experimenter did not collect the data this way. He made up a batch of pulp 
and obtained observations for all four temperatures from that batch. Because of the 
economics of preparing the batches and the size of the batches, this is the only feasible 
way to run this experiment. A completely randomized factorial experiment would require 
36 batches of pulp, which is completely unrealistic. The split-plot design requires only 
three batches of pulp per block (replicate), in this case 9 batches total. Obviously, the 
split-plot design has resulted in considerable experimental efficiency. 

The design used in our example is a split-plot design. Each replicate or block in the 
split-plot design is divided into three parts called whole plots, and the preparation meth¬ 
ods are called the whole plot or main treatments. Each whole plot is divided into four 
parts called subplots (or split-plots), and one temperature is assigned to each. Temper¬ 
ature is called the subplot treatment. Note that if there are other uncontrolled or un¬ 
designed factors present, and if these uncontrolled factors vary as the pulp preparation 
methods are changed, then any effect of the undesigned factors on the response will be 
completely confounded with the effect of the pulp preparation methods. Because the 
whole plot treatments in a split-plot design are confounded with the whole plots and the 
subplot treatments are not confounded, it is best to assign the factor we are most interested 
in to the subplots, if possible. 

This example is fairly typical of how the split-plot design is used in an industrial 
setting. Notice that the two factors were essentially “applied” at different times. Con¬ 
sequently, a split-plot design can be viewed as two experiments “combined” or super- 


Table 13-14 The Experiment oi 

i the Tensile Strength of Paper 







Replicate (or 
Block) 1 



Replicate (or 
Block) 2 



Replicate (or 
Block) 3 


Pulp Preparation Method 

1 

2 

3 

1 

2 

3 

_ 1 _ 

2 

3 

Temperature (°F) 

200 

30 

34 

29 

28 

31 

31 

31 

35 

32 

225 

35 

41 

26 

32 

36 

30 

37 

40 

34 

250 

37 

38 

33 

40 

42 

32 

41 

39 

39 

275 

36 

42 

36 

41 

40 

40 

40 

44 

45 
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imposed on each other. One “experiment” has the whole plot factor applied to the large 
experimental units (or it is a factor whose levels are hard to change) and the other 
“experiment” has the subplot factor applied to the smaller experimental units (or it is a 
factor whose levels are easy to change). 

The linear model for the split-plot design is 

y ijk = fi+ Ti + 13j + (t)8 )ij + y k + (ry),* 

['=>• 2.r 

+ (P y)j k + (t/S i) ijk + e ijk ij= 1,2 ,...,a (13-15) 

[k = 1, 2,..., b 

where r,-, (3 n and (t/ 3) (/ represent the whole plot and correspond respectively to blocks 
(or replicates), main treatments (factor A), and whole plot error (replicates (or 
blocks) X A); and y k , (ry) ik , (j 3y) jk , and (r^y),^ represent the subplot and correspond 
respectively to the subplot treatment (factor B), the replicates (or blocks) X B and AB 
interactions, and the subplot error (blocks X AB). Note that the whole plot error is the 
replicates (or blocks) X A interaction and the subplot error is the three-factor interaction 
blocks X AB. The sums of squares for these factors are computed as in the three-way 
analysis of variance without replication. 

The expected mean squares for the split-plot design, with replicates or blocks random 
and main treatments and subplot treatments fixed, are derived in Table 13-15. Note that 
the main factor (A) in the whole plot is tested against the whole plot error, whereas the 
subtreatment ( B ) is tested against the replicates (or blocks) X subtreatment interaction. 
The AB interaction is tested against the subplot error. Notice that there are no tests for 
the replicate (or block) effect (A) or the replicate (or block) X subtreatment (AC) 
interaction. 

The analysis of variance for the tensile strength data in Table 13-14 is summarized 
in Table 13-16 on the next page. Because both preparation methods and temperatures 
are fixed and replicates are random, the expected mean squares in Table 13-15 apply. 
The mean square for preparation methods is compared to the whole plot error mean 


Table 13-15 Expected Mean Square Derivation for Split-Plot Design 



Factor 

R 

F 

j 

b 

F 

k 

R 

h 

Expected Mean Square 


T . 

1 

a 

b 

1 

tr 2 + abrri 

Whole plot 

Pi 

r 

0 

b 

1 

rbl.fi 

a 2 + bfia + - -— 


(T/% 

1 

0 

b 

1 

cr 2 + balp 


yt 

r 

« 

0 

1 

2 2+ r*lyl 

° + a<J " + (b ~ 1) 


( ry) ik 

1 

a 

0 

1 

a 2 + aa 2 ^ 

Subplot 

mb 

(mint 

1 

0 

0 

0 

0 

1 

'■XX mb 

O' + (Trp-y + (fl _ 1)(fe _ 1} 

a 2 + crltiy 



1 

1 

1 

1 

a 3 (not estimable) 
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Table 13-16 Analysis of Variance for the Split-Plot Design Using Tensile Strength Data 
from Table 13-14 __ 


Source of Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

P-Value 

Replicates (or Blocks) 

77.55 

2 

38.78 



Preparation method (A) 

128.39 

2 

64.20 

7.08 

0.05 

Whole Plot Error (replicates (or Blocks) X A) 

36.28 

4 

9.07 



Temperature (B) 

434.08 

3 

144.69 

41.94 

<0.01 

Replicates (or Blocks) X B 

20.67 

6 

3.45 



AB 

75.17 

6 

12.53 

2.96 

0.05 

Subplot Error (replicates (or Blocks) X AB) 

50.83 

12 

4.24 



Total 

822.97 

35 





square, and the mean square for temperatures is compared to the replicate (or block) X 
temperature (AC) mean square. Finally, the preparation method X temperature mean 
square is tested against the subplot error. Both preparation methods and temperature have 
a significant effect on strength. 

Note from Table 13-16 that the subplot error (4.24) is less than the whole plot error 
(9.07). This is the usual case in split-plot designs because the subplots are generally more 
homogeneous than the whole plots. This results in two different error structures for the 
experiment. Because the subplot treatments are compared with greater precision, it is 
preferable to assign the treatment we are most interested in to the subplots, if possible. 

Some authors propose a slightly different statistical model for the split-plot design, 
say 


y,jk = ii + Ti + Pj + (tP),j + y k + (Py)jk + 


i = 1, 2. r 

j = 1, 2 
k= 1,2,..., b 


(13-16) 


In this model (r]8 !),-,• is still the whole plot error, but the blocks X B and blocks X AB 
interactions have essentially been pooled with e ijk to form the subplot error. If we denote 
the variance of the subplot error term e ijk by a\ and make the same assumptions as for 
model (Equation 13-15), the expected mean squares become 


Factor 

E(MS ) 

r, (Replicates or Blocks) 

crl + aba 2 


rb 2) Pj 

Pi (A) 

ai + baip + 

a — 1 

(rp). 

cr^ + balp (whole plot error) 


y k (B) 


(py) jk (AB) 

°' £ + (a ~ 1 )(b - 1) 

*ijk 

t t\ (subplot error) 
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Notice that now both the subplot treatment (B) and the AB interaction are tested against 
the subplot error mean square. If one is reasonably comfortable with the assumption that 
the replicates (or blocks) X B and replicates (or blocks) X AB interactions are negligible, 
then this alternative model is entirely satisfactory. 

The split-plot design has an agricultural heritage, with the whole plots usually being 
large areas of land and the subplots being smaller areas of land within the large areas. 
For example, several varieties of a crop could be planted in different fields (whole plots), 
one variety to a field. Then each field could be divided into, say, four subplots, and each 
subplot could be treated with a different type of fertilizer. Here the crop varieties are the 
main treatments and the different fertilizers are the subtreatments. 

Despite its agricultural basis, the split-plot design is useful in many scientific and 
industrial experiments. In these experimental settings, it is not unusual to find that some 
factors require large experimental units whereas other factors require small experimental 
units, such as in the tensile strength problem described above. Alternatively, we some¬ 
times find that complete randomization is not feasible because it is more difficult to 
change the levels of some factors than others. The hard-to-vary factors form the whole 
plots whereas the easy-to-vary factors are run in the subplots. 

In principle, we must carefully consider how the experiment must be conducted 
and incorporate all restrictions on randomization into the analysis. We illustrate this 
point using a modification of the eye focus time experiment in Chapter 6. Suppose there 
are only two factors, visual acuity (A) and illumination level (B). A factorial experiment 
with a levels of acuity, b levels of illumination, and n replicates would require that 
all abn observations be taken in random order. However, in the test apparatus, it is 
fairly difficult to adjust these two factors to different levels, so the experimenter de¬ 
cides to obtain the n replicates by adjusting the device to one of the a acuities and one 
of the b illumination levels and running all n observations at once. In the factorial design, 
the error actually represents the scatter or noise in the system plus the ability of the 
subject to reproduce the same focus time. The model for the factorial design could be 
written as 


yijk = At + Tt + fa + (r/3) y + <f> ijk + e ijk 


i= 1, 2,..., a 
j = 1, 2,..., b (13-17) 
k= 1,2,...,« 


where 4> IJk represents the scatter or noise in the system that results from “experimental 
error’ ’ (that is, our failure to duplicate exactly the same levels of acuity and illumination 
on different runs, variability in environmental conditions, and the like), and 9 ijk represents 
the “reproducibility error” of the subject. Usually, we combine these components into 
one overall error term, say e iJk = <f> ijk + 0 ijk . Assume that V{e ljk ) = a 2 = a\ + a 2 e . Now, 
in the factorial design, the error mean square has expectation cr 2 = cr^ + crj, with 
ab(n — 1) degrees of freedom. 

If we restrict the randomization as in the second design above, then the “error” 
mean square in the analysis of variance provides an estimate of the “reproducibility 
error” a 2 e with ab(n - 1) degrees of freedom, but it yields no information on the “ex¬ 
perimental error” a\. Thus, the mean square for error in this second design is too small; 
consequently, we will wrongly reject the null hypothesis very frequently. As pointed out 
by John (1971), this design is similar to a split-plot design with ab whole plots, each 
divided into n subplots, and no subtreatment. The situation is also similar to subsam- 
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pling, as described by Ostle (1963). Assuming that A and B are fixed, the expected mean 
squares in this case are 

bn E 7? 

E(MS a ) = ai + nal + _ 

«« E Pj 

E(MS b ) = o-^ + ncr^, + ^ _ 1 

n E E ( T P% 

E(MS ab ) = <r| + ncr^, + --—-— 

(a - l)(b - 1) 

£(MS £ ) = o-| (13-18) 

Thus, there are no tests on the main effects unless interaction is negligible. The situation 
is exactly that of a two-way analysis of variance with one observation per cell. If both 
factors are random, then the main effects may be tested against the AB interaction. If 
only one factor is random, then the fixed factor can be tested against the AB interaction. 

In general, if one analyzes a factorial design and all the main effects and interactions 
are significant, then one should examine carefully how the experiment was actually 
conducted. There may be randomization restrictions in the model not accounted for in 
the analysis, and consequently, the data should not be analyzed as a factorial. 

13-5 OTHER VARIATIONS OF THE SPLIT-PLOT DESIGN 

13-5.1 Split-Plot Designs with More Than Two Factors 

Sometimes we find that either the whole plot or the subplot will contain two or more 
factors, arranged in a factorial structure. As an example, consider an experiment con¬ 
ducted on a furnace to grow an oxide on a silicon wafer. The response variables of interest 
are oxide layer thickness and layer uniformity. There are four design factors: temperature 
(A), gas flow (B), time (C), and wafer position in the furnace (D). The experimenter plans 
to run a 2 4 factorial design with two replicates (32 trials). Now factors A and B (tem¬ 
perature and gas flow) are difficult to change, whereas C and D (time and wafer position) 
are easy to change. This leads to the split-plot design shown in Figure 13-7 on the facing 
page. Notice that both replicates of the experiment are split into four whole plots, each 
containing one combination of the settings of temperature and gas flow. Once these levels 
are chosen, each whole plot is split into four subplots and a 2 2 factorial in the factors 
time and wafer position is conducted, where the treatment combinations in the subplot 
are tested in random order. Only four changes in temperature and gas flow are made in 
each replicate, whereas the levels of time and wafer position are completely randomized. 

A model for this experiment, consistent with Equation 13-16, is 
ytjkim = H + Ti + Pj + y k + (Py)jk + 8 iJk + 8, + A m + (8A), m 

+ 038),, + (j3A), m + (y8) kl + (8A) /m + (pyS) Jkl + (pyX) jkm 

| i= 1, 2 

j= 1, 2 

k = 1, 2 (13-19) 

l = 1, 2 
m = 1 , 2 
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_ Block 1 _ 

:0 0 0 0 


_ Block 2 _ 

0 0 0 0 


Subplot + 
C 


Figure 13-7 A split-plot design with four design factors, two in the whole plot and two in the subplot. 



where r, represents the replicate effect, f3 , and y k represent the whole plot main effects, 
Q ijk is the whole plot error, 5/ and \ m represent the subplot main effects, and e ijk!m is the 
subplot error. We have included all interactions between the four design factors. Table 
13-17 presents the analysis of variance for this design, assuming that replicates are ran¬ 
dom and all design factors are fixed effects. In this table cr| and al represent the vari¬ 
ances of the whole plot and subplot errors, respectively, a 2 r is the variance of the block 


Table 13-17 An Abbreviated Analysis for a Split-Plot Design with Factors A and B 

in the Whole Plots and Factors C and D in the Subplots (Refer to Figure 13-7) 


Source of 

Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Expected Mean Square 

Replicates ( 77 ) 

SS Rep ii CaKS 

1 

al + 16cr? 

A (ft) 

SS A 

1 

cr\ + Sal + A 

B( y k ) 

SS B 

1 

a\ + Sa 2 e + B 

AB 

SS AB 

1 

al + Sal + AS 

Whole Plot Error ( 6 ijk ) 

ss WP 

3 

al + Sal 

C (S t ) 

SSc 

1 

al + C 

D( AJ 

SS D 

1 

al + D 

CD 

SS CD 

1 

al + CD 

AC 

SS AC 

1 

al + AC 

BC 

ss BC 

1 

al + BC 

AD 

SS AD 

1 

al + AD 

BD 

SS BD 

1 

al + BD 

ABC 

SS ABC 

1 

cj\ 4- ABC 

ABD 

ss ABD 

1 

ai + ABD 

ACD 

ss ACD 

1 

al + ACD 

BCD 

SS BCD 

1 

al + BCD 

ABCD 

SS ABCD 

1 

al + ABCD 

Subplot Error (e yHm ) 

SS S p 

12 


Total 

SS T 

31 
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effects, and (for simplicity) we have used capital Latin letters to denote fixed-type effects. 
The whole plot main effects and interaction are tested against the whole plot error, 
whereas the subplot factors and all other interactions are tested against the subplot error. 
If some of the design factors are random, the test statistics will be different. In some 
cases there will be no exact F tests and Satterthwaite’s procedure (described in Chapter 
12) should be used. 

Factorial experiments with three or more factors in a split-plot structure tend to be 
rather large experiments. On the other hand, the split-plot structure often makes it easier 
to conduct a larger experiment. For instance, in the oxide furnace example, the experi¬ 
menters only have to change the hard-to-change factors (.A and B) eight times, so perhaps 
a 32-run experiment is not too unreasonable. It is possible to reduce the number of runs 
by using a fractional factorial for the design factors of interest. 


13-5.2 The Split-Split-Plot Design 

The concept of split-plot designs can be extended to situations in which randomization 
restrictions may occur at any number of levels within the experiment. If there are two 
levels of randomization restrictions, the layout is called a split-split-plot design. The 
following example illustrates such a design. 


EXAMPLE 13-3 . 

A researcher is studying the absorption times of a particular type of antibiotic capsule. 
There are three technicians, three dosage strengths, and four capsule wall thicknesses of 
interest to the researcher. Each replicate of a factorial experiment would require 36 
observations. The experimenter has decided on four replicates, and it is necessary to mn 
each replicate on a different day. Note that the days can be considered as blocks. Within 
a replicate (or a block) (day), the experiment is performed by assigning a unit of antibiotic 
to a technician who conducts the experiment on the three dosage strengths and the four 
wall thicknesses. Once a particular dosage strength is formulated, all four wall thicknesses 
are tested at that strength. Then another dosage strength is selected and all four wall 
thicknesses are tested. Finally, the third dosage strength and the four wall thicknesses 
are tested. Meanwhile, two other laboratory technicians also follow this plan, each start¬ 
ing with a unit of antibiotic. 

Note that there are two randomization restrictions within each replicate (or block): 
technician and dosage strength. The whole plots correspond to the technician. The order 
in which the technicians are assigned the units of antibiotic is randomly determined. The 
dosage strengths form three subplots. Dosage strength may be randomly assigned to a 
subplot. Finally, within a particular dosage strength, the four capsule wall thicknesses 
are tested in random order, forming four sub-subplots. The wall thicknesses are usually 
called sub-subtreatments. Because there are two randomization restrictions in the exper¬ 
iment (some authors say two “splits” in the design), the design is called a split-split- 
plot design. Figure 13-8 (page 581) illustrates the randomization restrictions and 
experimental layout in this design. 
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Dosage 

strength 


Wall 

thicknesses 


Technician 

1 2 3 


1 2 3 1 2 3 1 2 3 


1 1 1 
Wall 2 2 2 

thicknesses 3 3 3 


Wall 

thicknesses 



Wall 2 2 2 

thicknesses 3 3 3 


Figure 13-8 A split-split-plot design. 


A linear statistical model for the split-split-plot design is 


ytjkh = m + Ti + Pj + (Tj8)y + y k + (jy)ik + (Py) Jk + (rPy) iJk 

+ S h + (tS),/, + (p8) Jh 

+ (t/3S), ;7i + (y S) kh + (ry8) ikh + (Py8) jkh 


+ (TPyS) ijkh + e m 


i = 1,2 
j= 1,2,..., a 
k = 1, 2 
h = 1,2,..., c 


(13-20) 


where r,, j8 y , and (tj8), 7 represent the whole plot and correspond to replicates or blocks, 
main treatments (factor A), and whole plot error (replicates (or blocks) X A), respectively; 
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and y k , ( ry) ik , (py) Jk and (yfiy) ijk represent the subplot and correspond to the subplot 
treatment (factor B), the replicates (or blocks) X B and AB interactions, and the sub¬ 
plot error, respectively; and 8 h and the remaining parameters correspond to the sub¬ 
subplot and represent respectively the sub-subplot treatment (factor C) and the remaining 
interactions. The four-factor interaction (tj 3y8) ijkh is called the sub-subplot error. 

Ass umin g that the replicates (blocks) are random and that the other design factors 
are fixed, we may derive the expected mean squares as shown in Table 13-18. Tests on 
the main treatments, subtreatments, sub-subtreatments and their interactions are obvious 
from inspection of this table. Note that no tests on replicates or blocks or interactions 
involving replicates or blocks exist. 

The statistical analysis of a split-split-plot design is like that of a single replicate of 
a four-factor factorial. The number of degrees of freedom for each test are determined 
in the usual manner. To illustrate, in Example 13-3, where we had four replicates, three 
technicians, three dosage strengths, and four wall thicknesses, we would have only (r - 
l)(a - 1) = (4 - 1)(3 - 1) = 6 whole plot error degrees of freedom for testing tech¬ 
nicians. This is a relatively small number of degrees of freedom, and the experimenter 
might consider using additional replicates to increase the precision of the test. If there 
are a replicates, we will have 2(r — 1) degrees of freedom for whole plot error. Thus, 


Table 13-18 Expected Mean Square Derivation for the Split-Split-Plot Design 



R 

Factor i 

F 

j 

b 

F 

k 

c 1 

F R 

h l 

Expected Mean Square 


T f 1 

a 

b 

c 1 

a 2 + abcdal 

Whole plot 

ft 

0 

b 

c 1 

, , rbc 2 ft 

' + + (a - 1) 


i 

0 

b 


a 2 + bcaip 


y* r 

a 

0 


-Syf 

a 2 + accr^y + 


(ry) ik 1 

a 

0 

c 1 

a 2 + acaiy 

Subplot 

(Py)jk r 

0 

0 

c 1 

rc'Z'Z (Py)% 


rjS y) ijk 1 

0 

0 

c 1 

a 2 + ca 2 ^ 


ft r 

a 

b 

0 1 

rab 2 ft 

a 2 + ab&rt + 

(c - 1) 


(rS) lh 1 

a 

b 

0 1 

a 2 + abais 


m Jh r 

0 

b 

0 1 

, , , 


(r m* i 

0 

b 

0 1 

a 2 + bai ps 

Sub-subplot 

(yS) kh r 

* 

0 

0 1 

ra'Z'Z (yS)l h 


(ry8) ikh 1 

a 

0 

0 1 

a 2 + aaiys 


(Py8)jkh r 

0 

0 

0 1 

, , . 

+ + (a - 1 )(b - l)(c - 1) 


(rpyS) IJkh 1 

0 

0 

0 1 

a 2 + cr^g 


e iwm 1 

1 

1 

l l 

a 2 (not estimable) 
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Whole plots | 



five replicates yield 2(5 - 1) = 8 error degrees of freedom, six replicates yield 2(6 - 
1) = 10 error degrees of freedom, seven replicates yield 2(7 — 1) = 12 error degrees of 
freedom, and so on. Consequently, we would probably not want to run fewer than four 
replicates because this would yield only four error degrees of freedom. Each additional 
replicate allows us to gain two degrees of freedom for error. If we could afford to run 
five replicates, we could increase the precision of the test by one-third (from six to eight 
degrees of freedom). Also, in going from five to six replicates, there is an additional 25 
percent gain in precision. If resources permit, the experimenter should run five or six 
replicates. 


13-5.3 The Strip-Split-Plot Design 

The strip-split-plot design has had extensive application in the agricultural sciences, but 
it finds occasional use in industrial experimentation. In the simplest case we have two 
factors A and B. Factor A is applied to whole plots just as in the standard split-plot design. 
Then factor B is applied to strips (which are really just another set of whole plots) that 


Table 13-19 An Abbreviated Analysis of Variance for a Strip-Split-Plot Design 

Source of 

Variation 

Sum of 
Squares 

Degrees of 

Freedom 

Expected Mean Square 

Replicates (or Blocks) 

SS ReplIcates 

r - 1 

cr\ + abai 

rb 2 P] 

A 

ss A 

a - 1 

+ bcrlp 4-j— 

Whole Plot Error 4 

ss WPa 

(r - l)(a - 1) 

<rl + balp 

2 2 ra^yl 

B 

ss b 

b — 1 

cr{ +acriy + —-— 

o—l 

Whole Plot Error e 

SS W p n 

(r - m - 1) 

a\ + aaly 

tSU (jP)% 

AB 

SS AB 

(a - IX* - 1) 

at+ {a - m - 1) 

Subplot Error 

SSsr 

(r ~ l)(a - l)(b - 1) 

oi 

Total 

ss, 

rab - 1 
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are orthogonal to the original whole plots used for factor A. Figure 13-9 (page 583) 
illustrates a situation in which both factors A and B have three levels. Note that the levels 
of factor A are confounded with the whole plots, and the levels of factor B are confounded 
with the strips (which can be thought of as a second set of whole plots). 

A model for the strip-split plot design in Figure 13-9, assuming y replicates, a levels 
of factor A, and b levels of factor B, is 


. r 

y iJk = /i. + T, + j8 j + (Tj8),y + y k + fry),* + U3y) jk + e Ijk i j = 1, 2,.. ., a 

[k = 1, 2,.... b 

where (r/3!),-,• and (ry) ik are whole plot errors for factors A and B, respectively, and e, jk is 
a “subplot” error, used to test the AB interaction. Table 13-19 (page 583) shows an 
abbreviated analysis of variance assuming A and B are fixed factors and replicates are 
random. The replicates are sometimes considered as blocks. 

13-6 PROBLEMS - 

13-1. A rocket propellant manufacturer is studying the burning rate of propellant from three 
production processes. Four batches of propellant are randomly selected from the output 
of each process and three determinations of burning rate are made on each batch. The 
results follow. Analyze the data and draw conclusions. 


Process 1 Process 2 Process 3 

Batch 123412341234 

25 19 15 15 19 23 18 35 14 35 38 25 

30 28 17 16 17 24 21 27 15 21 54 29 

26 20 14 13 14 21 17 25 20 24 50 33 


13-2. The surface finish of metal parts made on four machines is being studied. An experiment 
is conducted in which each machine is run by three different operators and two specimens 
from each operator are collected and tested. Because of the location of the machines, 
different operators are used on each machine, and the operators are chosen at random. 
The data are shown in the following table. Analyze the data and draw conclusions. 


Machine 1 


Machine 2 


Machine 3 

Machine 4 

Operator 1 2 

3 

1 2 

3 

1 2 3 

1 2 3 

79 94 

46 

92 85 

76 

88 53 46 

36 40 62 

62 74 

57 

99 79 

68 

75 56 57 

53 56 47 


13-3. A manufacturing engineer is studying the dimensional variability of a particular com¬ 
ponent that is produced on three machines. Each machine has two spindles, and four 
components are randomly selected from each spindle. The results follow. Analyze the 
data, assuming that machines and spindles are fixed factors. 



r 
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Machine 1 

Machine 2 

Machine 3 

Spindle 

1 

2 

1 

2 

1 

2 


12 

8 

14 

12 

14 

16 


9 

9 

15 

10 

10 

15 


11 

10 

13 

11 

12 

15 


12 

8 

14 

13 

11 

14 


13-4. To simplify production scheduling, an industrial engineer is studying the possibility of 
assigning one time standard to a particular class of jobs, believing that differences be¬ 
tween jobs are negligible. To see if this simplification is possible, six jobs are randomly 
selected. Each job is given to a different group of three operators. Each operator com¬ 
pletes the job twice at different times during the week, and the following results are 
obtained. What are your conclusions about the use of a common time standard for all 
jobs in this class? What value would you use for the standard? 


Job 

Operator 1 

Operator 2 

Operator 3 

1 

158.3 

159.4 

159.2 

159.6 

158.9 

157.8 

2 

154.6 

154.9 

157.7 

156.8 

154.8 

156.3 

3 

162.5 

162.6 

161.0 

158.9 

160.5 

159.5 

4 

160.0 

158.7 

157.5 

158.9 

161.1 

158.5 

5 

156.3 

158.1 

158.3 

156.9 

157.7 

156.9 

6 

163.7 

161.0 

162.3 

160.3 

162.6 

161.8 


13-5. Consider the three-stage nested design shown in Figure 13-5 to investigate alloy hard¬ 
ness. Using the data that follow, analyze the design, assuming that alloy chemistry and 
heats are fixed factors and ingots are random. Use the restricted form of the mixed model. 


Alloy Chemistry 

Heats 

1 



2 


1 

2 

3 1 

l 

2 

3 

Ingots 1 2 

1 2 

1 2 1 

2 

1 2 

1 2 

40 27 

95 69 

65 78 22 

23 

83 75 

61 35 

63 30 

67 47 

54 45 10 

39 

62 64 

77 42 


13-6. Reanalyze the experiment in Problem 13-5 using the unrestricted form of the mixed 
model. Comment on any differences you observe between the restricted and the unre¬ 
stricted model results. You may use a computer software package. 

13-7. Derive the expected mean squares for a balanced three-stage nested design, assuming 
that A is fixed and that B and C are random. Obtain formulas for estimating the variance 
components. Assume the restricted form of the mixed model. 

13-8. Repeat Problem 13-7 assuming the unrestricted form of the mixed model. You may use 
a computer software package to do this. Comment on any differences between the re¬ 
stricted and unrestricted model analysis and conclusions. 

13-9. Derive the expected mean squares for a balanced three-stage nested design if all three 
factors are random. Obtain formulas for estimating the variance components. 
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13-10. Verify the expected mean squares given in Table 13-1. 

13-11. Unbalanced nested designs. Consider an unbalanced two-stage nested design with b, 
levels of B under the ith level of A and n u replicates in the ijth cell. 

(a) Write down the least squares normal equations for this situation. Solve the normal 
equations. 

(b) Construct the analysis of variance table for the unbalanced two-stage nested design. 

(c) Analyze the following data, using the results in part (b): 


Factor A 1 _2_ 

Factor B _1_2_1_2_3_ 

6 —3 5 2 T 

4 17 4 0 

8 9 3 -3 

6 


13-12. Variance components in the unbalanced two-stage nested design. Consider the model 

{ / = 1, 2. a 

j = 1,2,..., bi 

k = 1, 2.no¬ 

where A and B are random factors. Show that 

E(MS a ) = o 2 + Ci a% + c 2 <r? 

E(MS bw ) = o 2 + Co&p 
E(MS e ) = o 2 

where 

N - 2 (X 




N - 2 "?• 
N 


13-13. A process engineer is testing the yield of a product manufactured on three machines. 

Each machine can be operated at two power settings. Furthermore, a machine has three 
stations on which the product is formed. An experiment is conducted in which each 
machine is tested at both power settings, and three observations on yield are taken from 
each station. The runs are made in random order, and the results follow. Analyze this 
experiment, assuming that all three factors are fixed. 
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Station 


Machine 1 


Machine 2 


Machine 3 

1 

2 

3 

1 

2 

3 

1 

2 

3 


34.1 

33.7 

36.2 

31.1 

33.1 

32.8 

32.9 

33.8 

33.6 

Power setting 1 

30.3 

34.9 

36.8 

33.5 

34.7 

35.1 

33.0 

33.4 

32.8 


31.6 

35.0 

37.1 

34.0 

33.9 

34.3 

33.1 

32.8 

31.7 


24.3 

28.1 

25.7 

24.1 

24.1 

26.0 

24.2 

23.2 

24.7 

Power setting 2 

26.3 

29.3 

26.1 

25.0 

25.1 

27.1 

26.1 

27.4 

22.0 


27.1 

28.6 

24.9 

26.3 

27.9 

23.9 

25.3 

28.0 

24.8 


13-14. Suppose that in Problem 13-13 a large number of power settings could have been used 
and that the two selected for the experiment were chosen randomly. Obtain the expected 
mean squares for this situation assuming the restricted form of the mixed model and 
modify the previous analysis appropriately. 

13-15. Reanalyze the experiment in Problem 13-14 assuming the unrestricted form of the mixed 
model. You may use a computer software package to do this. Comment on any differ¬ 
ences between the restricted and unrestricted model analysis and conclusions. 

13-16. A structural engineer is studying the strength of aluminum alloy purchased from three 
vendors. Each vendor submits the alloy in standard-sized bars of 1.0,1.5, or 2.0 inches. 
The processing of different sizes of bar stock from a common ingot involves different 
forging techniques, and so this factor may be important. Furthermore, the bar stock is 
forged from ingots made in different heats. Each vendor submits two test specimens of 
each size bar stock from three heats. The resulting strength data follow. Analyze the 
data, assuming that vendors and bar size are fixed and heats are random. Use the restricted 
form of the mixed model. 


Heat 


Vendor 1 


Vendor 2 


Vendor 3 


1 

2 

3 

1 

2 

3 

1 

2 

3 

Bar size: 1 inch 

1.230 

1.346 

1.235 

1.301 

1.346 

1.315 

1.247 

1.275 

1.324 


1.259 

1.400 

1.206 

1.263 

1.392 

1.320 

1.296 

1.268 

1.315 

I5 inch 

1.316 

1.329 

1.250 

1.274 

1.384 

1.346 

1.273 

1.260 

1.392 


1.300 

1.362 

1.239 

1.268 

1.375 

1.357 

1.264 

1.265 

1.364 

2 inch 

1.287 

1.346 

1.273 

1.247 

1.362 

1.336 

1.301 

1.280 

1.319 


1.292 

1.382 

1.215 

1.215 

1.328 

1.342 

1.262 

1.271 

1.323 


13-17. Rework Problem 13-16 using the unrestricted form of the mixed model. You may use a 
computer software package to do this. Comment on any differences between the re¬ 
stricted and unrestricted model analysis and conclusions. 

13-18. Suppose that in Problem 13-16 the bar stock may be purchased in many sizes and that 
the three sizes actually used in the experiment were selected randomly. Obtain the ex¬ 
pected mean squares for this situation and modify the previous analysis appropriately. 
Use the restricted form of the mixed model. 

13-19. Steel is normalized by heating above the critical temperature, soaking, and then air 
cooling. This process increases the strength of the steel, refines the grain, and homoge¬ 
nizes the structure. An experiment is performed to determine the effect of temperature 
and heat treatment time on the strength of normalized steel. Two temperatures and three 
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times are selected. The experiment is performed by heating the oven to a randomly 
selected temperature and inserting three specimens. After 10 minutes one specimen is 
removed, after 20 minutes the second is removed, and after 30 minutes the final specimen 
is removed. Then the temperature is changed to the other level and the process is repeated. 
Four shifts are required to collect the data, which are shown below. Analyze the data 
and draw conclusions, assuming both factors are fixed. 




Temperature, °F 

Shift 

Time, minutes 

1500 

1600 


10 

63 

89 

1 

20 

54 

91 


30 

61 

62 


10 

50 

80 

2 

20 

52 

72 


30 

59 

69 


10 

48 

73 

3 

20 

74 

81 


30 

71 

69 


10 

54 

88 

4 

20 

48 

92 


30 

59 

64 


13-20. An experiment is designed to study pigment dispersion in paint. Four different mixes of 
a particular pigment are studied. The procedure consists of preparing a particular mix 
and then applying that mix to a panel by three application methods (brushing, spraying, 
and rolling). The response measured is the percentage reflectance of pigment. Three days 
are required to run the experiment, and the data obtained follow. Analyze the data and 
draw conclusions, assuming that mixes and application methods are fixed. 


Day 

Application 

Method 


Mix 



1 

2 

3 

4 


1 

64.5 

66.3 

74.1 

66.5 

1 

2 

68.3 

69.5 

73.8 

70.0 


3 

70.3 

73.1 

78.0 

72.3 


1 

65.2 

65.0 

73.8 

64.8 

2 

2 

69.2 

70.3 

74.5 

68.3 


3 

71.2 

72.8 

79.1 

71.5 


1 

66.2 

66.5 

72.3 

67.7 

3 

2 

69.0 

69.0 

75.4 

68.6 


3 

70.8 

74.2 

80.1 

72.4 


13-21. Repeat Problem 13-20, assuming that the mixes are random and the application methods 
are fixed. 

Consider the split-split-plot design described in Example 13-3. Suppose that this exper¬ 
iment is conducted as described and that the data shown in the following table are 
obtained. Analyze the data and draw conclusions. 


13-22. 



13-6 PROBLEMS 
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Technician 




Replicates 
(or Blocks) 



1 



2 



3 


Dosage Strengths 

1 

2 

3 

1 

2 

3 

1 

2 

3 


Wall Thickness 

1 

95 

71 

108 

96 

70 

108 

95 

70 

100 

1 

2 

104 

82 

115 

99 

84 

100 

102 

81 

106 


3 

101 

85 

117 

95 

83 

105 

105 

84 

113 


4 

108 

85 

116 

97 

85 

109 

107 

87 

115 


1 

95 

78 

110 

100 

72 

104 

92 

69 

101 

2 

2 

106 

84 

109 

101 

79 

102 

100 

76 

104 


3 

103 

86 

116 

99 

80 

108 

101 

80 

109 


4 

109 

84 

110 

112 

86 

109 

108 

86 

113 


1 

96 

70 

107 

94 

66 

100 

90 

73 

98 

3 

2 

105 

81 

106 

100 

84 

101 

97 

75 

100 


3 

106 

88 

112 

104 

87 

109 

100 

82 

104 


4 

113 

90 

117 

121 

90 

117 

110 

91 

112 


1 

90 

68 

109 

98 

68 

106 

98 

72 

101 

4 

2 

100 

84 

112 

102 

81 

103 

102 

78 

105 


3 

102 

85 

115 

100 

85 

110 

105 

80 

110 


4 

114 

88 

118 

118 

85 

116 

110 

95 

120 


13-23. Rework Problem 13-22, assuming that the technicians are chosen at random. Use the 
restricted form of the mixed model. 

13-24. Suppose that in Problem 13-22 four technicians had been used. Assuming that all the 
factors are fixed, how many blocks should be run to obtain an adequate number of degrees 
of freedom on the test for differences among technicians? 

13-25. Consider the experiment described in Example 13-3. Demonstrate how the order in which 
the treatment combinations are run would be determined if this experiment were run as 
(a) a split-split-plot, (b) a split-plot, (c) a factorial design in a randomized block, and 
(d) a completely randomized factorial design. 



Other Design 
and Analysis 
Topics _ 


The subject of statistically designed experiments is an extensive one. The previous chap¬ 
ters have provided an introductory presentation of many of the basic concepts and meth¬ 
ods, yet in some cases, we have only been able to provide an overview. For example, 
there are book-length presentations of topics such as response surface methodology, 
mixture experiments, variance component estimation, and optimal design. In this chapter 
we provide an overview of several other topics that the experimenter may potentially 
find useful. 


14-1 NONNORMAL RESPONSES AND TRANSFORMATIONS 

14-1.1 Selecting a Transformation: The Box-Cox Method 

In Section 3-4.3 we discussed the problem of nonconstant variance in the response vari¬ 
able y from a designed experiment and noted that this is a departure from the standard 
analysis of variance assumptions. This inequality of variance problem occurs relatively 
often in practice, often in conjunction with a nonnormal response variable. Examples 
would include a count of defects or particles, proportion data such as yield or fraction 
defective, or a response variable that follows some skewed distribution (one “tail” of 
the response distribution is longer than the other). We introduced transformation of the 
response variable as an appropriate method for stabilizing the variance of the response. 
Two methods for selecting the form of the transformation were discussed, an empirical 
graphical technique and essentially trial-and-error in which the experimenter simply tries 
one or more transformations, selecting the one that produces the most satisfactory or 
pleasing plot of residuals versus the fitted response. 

Generally, transformations are used for three purposes; stabilizing response variance, 
makin g the distribution of the response variable closer to the normal distribution, and 
improving the fit of the model to the data. This last objective could include model 
simplification, say by eliminating interaction terms. Sometimes a transformation will 
590 
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be reasonably effective in simultaneously accomplishing more than one of these 
objectives. 

We have noted that the power family of transformations y* = y A is very useful, 
where A is the parameter of the transformation to be determined (e.g., A = 5 means use 
the square root of the original response). Box and Cox (1964) have shown how the 
transformation parameter A may be estimated simultaneously with the other model pa¬ 
rameters (overall mean and treatment effects). The theory underlying their method uses 
the method of maximum likelihood. The actual computational procedure consists of 
performing, for various values of A, a standard analysis of variance on 

\ y A ¥= 0 

y w = A y (14-1) 

y In y A = 0 


where y = ln _ 1 [(l/«) S In y] is the geometric mean of the observations. The maximum 
likelihood estimate of A is the value for which the error sum of squares, say SS E ( A), is 
a minimum. This value of A is usually found by plotting a graph of SS E ( A) versus A and 
then reading the value of A that minimizes SS E ( A) from the graph. Usually between 10 
and 20 values of A are sufficient for estimation of the optimum value. A second iteration 
using a finer mesh of values could be performed if a more accurate estimate of A is 
necessary. 

Notice that we cannot select the value of A by directly comparing the error sums of 
squares from analyses of variance on y A because for each value of A the error sum of 
squares is measured on a different scale. Furthermore, a problem arises in y when A = 
0; namely, as A approaches zero, y A approaches unity . 5 That is, when A = 0, all the 
response values are a constant. The component (y A — 1)/A of Equation 14-1 alleviates 
this problem because as A tends to zero, (y A — 1)/A goes to a limit of In y. The divisor 
component y A ~ 1 in Equation 14-1 rescales the responses so that the error sums of squares 
are directly comparable. 

In using the Box-Cox method, we recommend that the experimenter use simple 
choices for A because the practical difference between A = 0.5 and A = 0.58 is likely to 
be small, but the square root transformation (A = 0.5) is much easier to interpret. Ob¬ 
viously, values of A close to unity would suggest that no transformation is necessary. 

Once a value of A is selected by the Box-Cox method, the experimenter can analyze 
the data using y A as the response, unless of course A = 0, in which he can use In y. It is 
perfectly acceptable to use y (A) as the actual response, although the model parameter 
estimates will have a scale difference and origin shift in comparison to the results ob¬ 
tained using y A (or In y). 

An approximate 100(1 - a) percent confidence interval for A can be found by 
computing 


SS* = SS E ( A) 



(14-2) 


where v is the number of degrees of freedom, and plotting a line parallel to the A axis 
at height SS* on the graph of SS E ( A) versus A. Then by locating the points on the A axis 
where SS* cuts the curve SS £ (A), we can read confidence limits on A directly from the 
graph. If this confidence interval includes the value A = 1, this implies (as noted above) 
that the data do not support the need for transformation. 
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EXAMPLE 14-1 . 

We will illustrate the Box-Cox procedure, using the peak discharge data originally pre¬ 
sented in Example 3-5. Recall that this is a single-factor experiment (see Table 3-7 for 
the original data). Using Equation 14-1, we computed values of SS E ( A) for various values 
of A: 


A 

SS E ( A) 

-1.00 

7922.11 

-0.50 

687.10 

-0.25 

232.52 

0.00 

91.96 

0.25 

46.99 

0.50 

35.42 

0.75 

40.61 

1.00 

62.08 

1.25 

109.82 

1.50 

208.12 


A graph of values close to the minimum is shown in Figure 14-1 (on the facing page), 
from which it is seen that A — 0.52 gives a minimum value of approximately SS E (\) = 
35.00. An approximate 95 percent confidence interval on A is found by calculating the 
quantity SS* from Equation 14-2 as follows: 


SS* = SS E ( A) I 



= 42.61 


By plotting SS* on the graph in Figure 14-1 and by reading the points on the A scale 
where this line intersects the curve, we obtain lower and upper confidence limits on A of 
A = 0.27 and A + = 0.77. Because these confidence limits do not include the value 1, 
the use of a transformation is indicated, and the square root transformation (A = 0.50) 
actually used is easily justified. 


Some computer programs include the Box-Cox procedure for selecting a power 
family transformation. Figure 14-2 (on the facing page) presents the output from this 
procedure as implemented in Design-Expert for the peak discharge data. The results agree 
closely with the manual calculations summarized in Example 14-1. Notice that the ver¬ 
tical scale of the graph in Figure 14-2 is ln[SS £ (A)]. 
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Ar = 0.27 X* = 0.77 

Figure 14-1 Plot of SS E ( A) versus A for Example 14-1. 


DESIGN-EXPERT Plot 
Peak Discharge 

Lambda 
Current =1 
Best = 0.541377 
Low C.l. = 0.291092 
High C.l. = 0.791662 
Recommend 
transformation 
Square Root 
(Lambda = 0.5) 



-3-2-10123 


Lambda 

Figure 14-2 Output from Design-Expert for the Box-Cox procedure. 
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14-1.2 The Generalized Linear Model 

Data transformations are often a very effective way to deal with the problem of nonnor¬ 
mal responses and the associated inequality of variance. As we have seen in the previous 
section, the Box-Cox method is an easy and effective way to select the form of the 
transformation. However, there can be problems associated with the use of a data 
transformation. 

One problem is that the experimenter may be uncomfortable in working with the 
response in the transformed scale. That is, she is interested in the number of defects, not 
the square root of the number of defects, or in resistivity instead of the logarithm of 
resistivity. On the other hand, if a transformation is really successful and improves the 
analysis and the associated model for the response, experimenters will usually quickly 
adopt the new metric. 

A more serious problem is that a transformation can result in a nonsensical value 
for the response variable over some portion of the design factor space that is of interest 
to the experimenter. For example, suppose that we have used the square root transfor¬ 
mation in an experiment involving the number of defects observed on semiconductor 
wafers, and for some portion of the region of interest the predicted square root of the 
count of defects is negative. This is likely to occur for situations where the actual number 
of defects observed is small. Consequently, the model for the experiment has produced 
an obviously unreliable prediction in the very region where we would like this model to 
have good predictive performance. 

Finally, as noted in Section 14-1.1, we often use transformations to stabilize variance, 
induce normality, and simplify the model. There is no assurance that a transformation 
will effectively accomplish all of these objectives simultaneously. 

An alternative to the typical approach of data transformation followed by standard 
least squares analysis of the transformed response is to use the generalized linear model. 
This is an approach developed by Nelder and Wedderbum (1972) that essentially unifies 
linear and nonlinear models with both normal and nonnormal responses. McCullagh and 
Nelder (1989) give a comprehensive presentation of generalized linear models, and 
Myers and Montgomery (1997) provide a tutorial. More details are also given in the 
supplemental text material for this chapter. We will provide an overview of the concepts 
and illustrate them with two short examples. 

A generalized linear model is basically a regression model (an experimental design 
model is also a regression model). Like all regression models, it is made up of a random 
component (what we have usually called the error term) and a function of the design 
factors (the x’s) and some unknown parameters (the /3’s). In a standard normal-theory 
linear regression model we write 

y = fio + /Si*! + P 2 X 2 + • • • + P k *k + € (14-3) 

where the error term e is assumed to have a normal distribution with mean zero and 
constant variance, and the mean of the response variable y is 

E(y) = fi = j8 0 + PiXt + P 2 X 2 + ■ • • + f$kX k = x'/3 (14-4) 

The portion x'/3 of Equation (14-4) is called the linear predictor. The generalized linear 
model contains Equation (14-3) as a special case. 

In a generalized linear model the response variable can have any distribution that is 
a member of the exponential family. This family includes the normal, Poisson, bino- 
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mial, exponential, and gamma distributions, so the exponential family is a very rich 
and flexible collection of distributions applicable to many experimental situations. Also, 
the relationship between the response mean /a and the linear predictor x'/i is determined 
by a link function. 

gin) = x'/3 (14-5) 

The regression model that represents the mean response is then given by 

E(y) = fi = g~\x'P) (14-6) 

For example, the link function leading to the ordinary linear regression model in Equation 
(14-3) is called the identity link, because fi = g _1 (x'/3) = x'/3. As another example, 
the log link 

InO*) = x'/3 (14-7) 

produces the model 

H = e x ' p (14-8) 

The log link is often used with count data (Poisson response) and with continuous re¬ 
sponses that have a distribution that has a long tail to the right (the exponential or gamma 
distribution). Another important link function used with binomial data is the logit link 

ln (l^) = (14 ' 9) 

This choice of link function leads to the model 

f. - ^4-, (14-10) 

There are many possible choices of link function, but it must always be monotonic and 
differentiable. Also note that in a generalized linear model, the variance of the response 
variable does not have to be a constant; it can be a function of the mean (and the predictor 
variables through the link function). For example, if the response is Poisson, the variance 
of the response is exactly equal to the mean. 

To use a generalized linear model in practice, the experimenter must specify a re¬ 
sponse distribution and a link function. Then the model fitting or parameter estimation 
is done by the method of maximum likelihood, which for the exponential family turns 
out to be an iterative version of weighted least squares. For ordinary linear regression or 
experimental design models with a normal response variable, this reduces to standard 
least squares. Using an approach that is analogous to the analysis of variance for normal- 
theory data, we can perform inference and diagnostic checking for a generalized linear 
model. Refer to Myers and Montgomery (1997) for the details and examples. Two 
software packages that support the generalized linear model nicely are SAS (PROC 
GENMOD) and S-PLUS. 

EXAMPLE 14-2 . 

The Grill Defects Experiment 

Problem 8-29 introduced an experiment to study the effects of nine factors on defects in 
sheet-molded grill opening panels. Bisgaard and Fuller (1994-1995) performed an in¬ 
teresting and useful analysis of this data to illustrate the value of data transformation in 



Table 14-1 Least Squares and Generalized Linear Model Analysis for the Grill Opening Panels Experi 
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(2.15,4.86) 11.78 (4.13,23.10) 8.54 (5.62,12.98) 

(-0.28,2.43) 0.71 (*, 5.41) 0.81 (0.42, 1.56) 

(-1.82,0.89) * (*,0.36) 0.19 (0.09,0.38) 
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a designed experiment. As we observed in Problem 8-29 part (f), they used a modification 
of the square root transformation that led to the model 

(VfW^TT)/2 = 2.513 - 0.996x4 - 1.21x 6 - 0.772x 2 x 7 

where as usual, the x’s represent the coded design factors. This transformation does an 
excellent job of stabilizing the variance of the number of defectives. The first two panels 
of Table 14-1 (page 596) present some information about this model. Under the “Trans¬ 
formed” heading, the first column contains the predicted response. Notice that there are 
two negative predicted values. The “untransformed” heading presents the untransformed 
predicted values, along with 95 percent confidence intervals on the mean response at 
each of the 16 design points. Because there were some negative predicted values, and 
negative lower confidence limits, we were unable to compute values for all of the entries 
in this panel of the table. 

The response is essentially a square root of the count of defects. A negative predicted 
value is clearly illogical. Notice that this is occurring where the observed counts were 
small. If it is important to use the model to predict performance in this region, the model 
may be unreliable. This should not be taken as criticism of either the original experi¬ 
menters or Bisgaard and Fuller’s analysis. This was an extremely successful screening 
experiment that clearly defined the important processing variables. Prediction was not 
the original goal, nor was it the goal of the analysis done by Bisgaard and Fuller. 

However, if obtaining a prediction model had been important, a generalized linear 
model would probably have been a good alternative to the transformation approach. 
Myers and Montgomery use a log link [Equation (14-7)] and Poisson response to fit 
exactly the same linear predictor given by Bisgaard and Fuller. This produces the model 

y = g(l-128-0.896* 4 - 1.176^6-0.737^,) 

The third panel of Table 14-1 contains the predicted values from this model and the 
95 percent confidence intervals on the mean response at each point in the design (obtained 
from SAS PROC GENMOD). There are no negative predicted values (assured by the 
choice of link function) and no negative lower confidence limits. The last panel of the 
table compares the lengths of the 95 percent confidence intervals for the untransformed 
response and the generalized linear model (GLM). Notice that the confidence intervals 
for the generalized linear model are uniformly shorter than their least squares counter¬ 
parts. This is a strong indication that the generalized linear model approach has explained 
more variability and produced a superior model compared to the transformation approach. 


EXAMPLE 14-3 . 

The Worsted Yarn Experiment 

Table 14-2 (page 598) presents a 3 3 factorial design conducted to investigate the perfor¬ 
mance of worsted yam under cycles of repeated loading. The experiment is described 
thoroughly by Box and Draper (1987). The response is the number of cycles to failure. 
Reliability data such as this is typically nonnegative and continuous and often follows a 
distribution with a long right tail. 

The data was initially analyzed using the standard (least squares) approach, and data 
transformation was necessary to stabilize the variance. The log of the cycles to failure 
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Table 14-2 The Worsted Yam Experiment 


Run 


x 2 

*3 

Cycles to 
Failure 

Log 

Cycles to 
Failure 

1 

-1 

-1 

-1 

674 

2.83 

2 

-1 

-1 

0 

370 

2.57 

3 

-1 

-1 

1 

292 

2.47 

4 

-1 

0 

-1 

338 

2.53 

5 

-1 

0 

0 

266 

2.42 

6 

-1 

0 

1 

210 

2.32 

7 

-1 

1 

-1 

170 

2.23 

8 

-1 

1 

0 

118 

2.07 

9 

-1 

1 

1 

90 

1.95 

10 

0 

-1 

-1 

1414 

3.15 

11 

0 

-1 

0 

1198 

3.08 

12 

0 

-1 

1 

634 

2.8 

13 

0 

0 

-1 

1022 

3.01 

14 

0 

0 

0 

620 

2.79 

15 

0 

0 

1 

438 

2.64 

16 

0 

1 

-1 

442 

2.65 

17 

0 

1 

0 

332 

2.52 

18 

0 

1 

1 

220 

2.34 

19 

1 

-1 

-1 

3636 

3.56 

20 

1 

-1 

0 

3184 

3.5 

21 

1 

-1 

1 

2000 

3.3 

22 

1 

0 

-1 

1568 

3.19 

23 

1 

0 

0 

1070 

3.03 

24 

1 

0 

1 

566 

2.75 

25 

1 

1 

-1 

1140 

3.06 

26 

1 

1 

0 

884 

2.95 

27 

1 

1 

1 

360 

2.56 


data is found to yield an adequate model in terms of overall model fit and satisfactory 
residual plots. The model is 

log y = 2.751 + 0.3617*! - 0.2739x 2 - 0.1711*3 
or in terms of the original response, cycles to failure, 

- _ jq 2.751 +0.3617x, -0.2739x 2 -0.171 l* 3 

This experiment was also analyzed using the generalized linear model, selecting 
the gamma response distribution and the log link. We used exactly the same model 
form found by least squares analysis of the log-transformed response. The model that 
resulted is 

y = ^6.3489+0.8425^:,-0.6313^2-0.3851x3 

Table 14-3 (on the facing page) presents the predicted values from the least squares 
model and the generalized linear model, along with 95 percent confidence intervals on 
the mean response at each of the 27 points in the design. A comparison of the lengths 



Table 14-3 Least Squares and Generalized Linear Model Analysis for the Worsted Yam Exper 
Least Squares Methods with Log Data Transformation 
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of the confidence intervals reveals that the generalized linear model is likely to be a better 
predictor than the least squares model. 


Generalized linear models have found extensive application in biomedical and phar¬ 
maceutical research and development. As more software packages incorporate this ca¬ 
pability, it will find widespread application in the general industrial research and 
development environment. 


14-2 UNBALANCED DATA IN A FACTORIAL DESIGN 

The primary focus of this book has been the analysis of balanced factorial designs, that 
is, cases where there are an equal number of observations n in each cell. However, it is 
not unusual to encounter situations where the number of observations in the cells are 
unequal. These unbalanced factorial designs occur for various reasons. For example, 
the experimenter may have designed a balanced experiment initially, but because of 
unforeseen problems in running the experiment, resulting in the loss of some observa¬ 
tions he ends up with unbalanced data. On the other hand, some unbalanced experiments 
are deliberately designed that way. For instance, certain treatment combinations may be 
more expensive or more difficult to run than others, so fewer observations may be taken 
in those cells. Alternatively, some treatment combinations may be of greater interest to 
the experimenter because they represent new or unexplored conditions, so may elect to 
obtain additional replication in those cells. 

The orthogonality property of main effects and interactions present in balanced data 
does not carry over to the unbalanced case. This means that the usual analysis of variance 
techniques do not apply. Consequently, the analysis of unbalanced factorials is much 
more difficult than that for balanced designs. 

In this section we give a brief overview of methods for dealing with unbalanced 
factorials, concentrating on the case of the two-factor fixed effects model. Suppose that 
the number of observations in the yth cell is n tJ , Furthermore, let n L - 2, =1 n tj be the 
number of observations in the ith row (the ith level of factor A), n.j = 2?=i n tj be the 
number of observations in the yth column (the yth level of factor B), and n. - 
2" =1 2*=i n.j be the total number of observations. 


14-2.1 Proportional Data: An Easy Case 

One situation involving unbalanced data presents little difficulty in analysis; this is the 
case of proportional data. That is, the number of observations in the yth cell is 


(14-11) 


This condition implies that the number of observations in any two rows or columns are 
proportional. When proportional data occur, the standard analysis of variance can be 
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employed. Only minor modifications are required in the manual computing formulas for 
the sums of squares, which become 


SS T 

SS A 

SS B 

SSab 


= ii 

-ii 

-IS 

= ii 



yi 


n_ 

yt 

n 


y± _ >d 

n.jj n„ 


- SS A - SS B 


ss E = ss T - ss A - ss B - SS AB 





yf t . 

n ij 


As an example of proportional data, consider the battery design experiment in Ex¬ 
ample 5-1. A modified version of the original data is shown in Table 14-4. Clearly, the 
data are proportional; for example, in cell 1,1 we have 




WiAi _ 10(8) 

n. 20 


observations. The results of applying the usual analysis of variance to these data are 
shown in Table 14-5 on the next page. Both material type and temperature are significant, 
which agrees with the analysis of the full data set in Example 5-1. However, the inter¬ 
action noted in Example 5-1 is not present. 


14-2.2 Approximate Methods 

When unbalanced data are not too far from the balanced case, it is sometimes possible 
to use approximate procedures that convert the unbalanced problem into a balanced 


Table 14-4 Battery Design Experiment with Proportional Data 





Temperature, °F 



Type 

15 


70 


125 



"u =4 


n l2 = 4 


W 13 = 2 

72, = 10 


130 

155 

34 

40 



1 

74 

180 

80 

75 

70 58 

y,.. = 896 


« 2 | = 2 


«22 = 2 


7223 = 1 

722. = 5 

2 

159 

126 

136 

115 

45 

y 2 .. = 581 


n 3l = 2 


«32 = 2 


«33 = 1 

723. = 5 

3 

138 

160 

150 

139 

96 

yj.. = 683 


n x = 8 


72.2 = 8 


72.3 = 4 

72.. = 20 


y.i. = 1122 


y.2. = 769 


y.3. = 269 

y... = 2160 
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Table 14-5 Analysis of Variance for Battery Design Data 
in Table 14-4 _ 


Source of 
Variance 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

F 0 

Material types 

8,170.400 

2 

4,085.20 

5.00 

Temperature 

16,090.875 

2 

8,045.44 

9.85 

Interaction 

5,907.725 

4 

1,476.93 

1.18 

Error 

8,981.000 

11 

816.45 


Total 

39,150.000 

19 




one. This of course makes the analysis only approximate, but the analysis of balanced 
data is so easy that we are frequently tempted to use this approach. In practice, we must 
decide when the data are not sufficiently different from the balanced case to make the 
degree of approximation introduced relatively unimportant. We now briefly describe 
some of these approximate methods. We assume that every cell has at least one obser¬ 
vation (i.e., riij > 1). 

Estimating Missing Observations 

If only a few n u are different, a reasonable procedure is to estimate the missing values. 
For example, consider the unbalanced design in Table 14-6. Clearly, estimating the single 
missing value in cell 2,2 is a reasonable approach. For a model with interaction, the 
estimate of the missing value in the ijth cell that minimizes the error sum of squares is 
y u . That is, we estimate the missing value by taking the average of the observations that 
are available in that cell. 

The estimated value is treated just like actual data. The only modification to the 
analysis of variance is to reduce the error degrees of freedom by the number of missing 
observations that have been estimated. For example, if we estimate the missing value in 
cell 2,2 in Table 14-6, we would use 26 error degrees of freedom instead of 27. 

Setting Data Aside 

Consider the data in Table 14-7 on the facing page. Note that cell 2,2 has only one more 
observation than the others. Estimating missing values for the remaining eight cells is 
probably not a good idea here because this would result in estimates constituting about 
18 percent of the final data. An alternative is to set aside one of the observations in cell 
2,2, giving a balanced design with n = 4 replicates. 

The observation that is set aside should be chosen randomly. Furthermore, rather 
than completely discarding the observation, we could return it to the design, then ran¬ 
domly choose another observation to set aside and repeat the analysis. And, we hope, 

Table 14-6 The n,. Values for an 
Unbalanced Design 
Columns 
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Table 14-7 The n tj Values for an 
Unbalanced Design 
Columns 

Rows 12 3 

1 4 4 4 

2 4 5 4 

3 4 4 4 


these two analyses will not lead to conflicting interpretations of the data. If they do, we 
suspect that the observation that was set aside is an outlier or a wild value and should 
be handled accordingly. In practice, this confusion is unlikely to occur when only small 
numbers of observations are set aside and the variability within the cells is small. 


Method of Unweighted Means 

In this approach, introduced by Yates (1934), the cell averages are treated as data and 
are subjected to a standard balanced data analysis to obtain sums of squares for rows, 
columns, and interaction. The error mean square is found as 

III (yuk ~ yu.) 2 

MS e = ' ly ~‘ * 1 --- (14-12) 

n.. ~ ab 

Now MS e estimates <x 2 , the variance of y ijk , an individual observation. However, we have 
done an analysis of variance on the cell averages, and because the variance of the average 
in the ijth cell is o- 2 /n,j, the error mean square actually used in the analysis of variance 
should be an estimate of the average variance of the y^, say 


2 2 0- 2 /«y 


V(y tJ .) = 


ab 


= ^ 22 : 


Using MS/ from Equation 14-12 to estimate <x 2 


ab Yi/j 

i Equation 14-13, we obtain 


(14-13) 


“x-ir&iz 


as the error mean square (with n_ — ab degrees of freedom) to use in the analysis of 
variance. 

The method of unweighted means is an approximate procedure because the sums of 
squares for rows, columns, and interaction are not distributed as chi-square random vari¬ 
ables. The primary advantage of the method seems to be its computational simplicity. 
When the n 0 are not dramatically different, the method of unweighted means often works 
reasonably well. 

A related technique is the weighted squares of means method, also proposed by 
Yates (1934). This technique is also based on the sums of squares of the cell means, but 
the terms in the sums of squares are weighted in inverse proportion to their variances. 
For further details of the procedure, see Searle (1971a) and Speed, Hocking, and Hackney 
(1978). 
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14-2.3 The Exact Method 

In situations where approximate methods are inappropriate, such as when empty cells 
occur (some n tJ = 0) or when the n u are dramatically different, the experimenter must 
use an exact analysis. The approach used to develop sums of squares for testing main 
effects and interactions is to represent the analysis of variance model as a regression 
model, fit that model to the data, and use the general regression significance test ap¬ 
proach. However, there are several ways that this may be done, and these methods may 
result in different values for the sums of squares. Furthermore, the hypotheses that are 
being tested are not always direct analogs of those for the balanced case, nor are they 
always easily interpretable. For further reading on the subject, see the supplemental text 
material for this chapter. Other good references are Searle (1971a); Speed and Hocking 
(1976); Hocking and Speed (1975); Hocking, Hackney, and Speed (1978); Speed, Hock¬ 
ing, and Hackney (1978); Searle, Speed, and Henderson (1981); Searle (1987); and Mil- 
liken and Johnson (1984). The SAS system of statistical software provides an excellent 
approach to the analysis of unbalanced data through PROC GLM. 


14-3 THE ANALYSIS OF COVARIANCE 

In Chapters 2 and 4 we introduced the use of the blocking principle to improve the 
precision with which comparisons between treatments are made. The paired t test was 
the procedure illustrated in Chapter 2 and the randomized block design was presented in 
Chapter 4. In general, the blocking principle can be used to eliminate the effect of 
controllable nuisance factors. The analysis of covariance is another technique that is 
occasionally useful for improving the precision of an experiment. Suppose that in an 
experiment with a response variable y there is another variable, say x, and that y is linearly 
related to x. Furthermore, suppose that x cannot be controlled by the experimenter but 
can be observed along with y. The variable x is called a covariate or concomitant 
variable. The analysis of covariance involves adjusting the observed response variable 
for the effect of the concomitant variable. If such an adjustment is not performed, the 
concomitant variable could inflate the error mean square and make true differences in 
the response due to treatments harder to detect. Thus, the analysis of covariance is a 
method of adjusting for the effects of an uncontrollable nuisance variable. As we will 
see, the procedure is a combination of analysis of variance and regression analysis. 

As an example of an experiment in which the analysis of covariance may be em- 


Table 14-8 Breaking Strength Data (y = strength in pounds 
and x = diameter in 1CT 3 inches) 


Machine 1 Machine 2 Machine 3 


36 

41 
39 

42 
49 

207 


20 

25 

24 

25 
32 

126 


40 

48 

39 

45 

44 

216 


22 

28 

22 

30 

28 

130 


y 

35 

37 

42 

34 

32 

180 


21 

23 

26 

21 

J5 

106 
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ployed, consider a study performed to determine if there is a difference in the strength 
of a monofilament fiber produced by three different machines. The data from this ex¬ 
periment are shown in Table 14-8 (page 604). Figure 14-3 presents a scatter diagram of 
strength (y) versus the diameter (or thickness) of the sample. Clearly the strength of the 
fiber is also affected by its thickness; consequently, a thicker fiber will generally be 
stronger than a thinner one. The analysis of covariance could be used to remove the effect 
of thickness (x) on strength (y) when testing for differences in strength between machines. 


14-3.1 Description of the Procedure 

The basic procedure for the analysis of covariance is now described and illustrated for a 
single-factor experiment with one covariate. Assuming that there is a linear relationship 
between the response and the covariate, an appropriate statistical model is 

y tt “ /* + t, + p&j - xj + €ij |‘ = j’ 2 « (14-15) 

where y, y is the yth observation on the response variable taken under the ith treatment or 
level of the single factor, x tJ is the measurement made on the covariate or concomitant 
variable corresponding to y :J (i.e., the ijth run), x. is the mean of the x, y values, /x is an 
overall mean, t, is the effect of the ith treatment, /3 is a linear regression coefficient 
indicating the dependency of y, ; on x, y , and e, y is a random error component. We assume 
that the errors e, y are NID(0, a 2 ), that the slope /? A 0 and the true relationship between 
y, y and x :J is linear, that the regression coefficients for each treatment are identical, that 
the treatment effects sum to zero (2? =1 r, = 0), and that the concomitant variable x, y is 
not affected by the treatments. 

Notice from Equation 14-15 that the analysis of covariance model is a combination 
of the linear models employed in analysis of variance and regression. That is, we have 
treatment effects { t, } as in a single-factor analysis of variance and a regression coefficient 
ft as in a regression equation. The concomitant variable in Equation 14-15 is expressed 
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as {Xjj -x) instead of x tJ so that the parameter /a is preserved as the overall mean. The 
model could have been written as 

y.j = /*' + T - + P X ‘J + e v {j = 1,2,..., n (14 ' 16) 

where fx' is a constant not equal to the overall mean, which for this model is jx + fix... 
Equation 14-15 is more widely found in the literature. 

To describe the analysis, we introduce the following notation: 


= ± X (Xij - *_Xy„ - yj = i X Xijy.j - 

T„ = n± {y,.-yf = l -2y;.- y ^ n 

T xy = «X Ok - *..)(?,. - y..> = J X C*J(yJ _ 

£ xv = x X (yu - yj 2 = s yy ~ J yy 

E xx = X X (*« - ^.) 2 = - T xx 

E xy = X X (*</ - ^.)(yy - y.0 = s v - r xy 


WOO 


(xj(yj 


(14-17) 

(14-18) 

(14-19) 

(14-20) 

(14-21) 

(14-22) 

(14-23) 

(14-24) 

(14-25) 


Note that, in general, S = T + E, where the symbols S, T, and E are used to denote sums 
of squares and cross-products for total, treatments, and error, respectively. The sums of 
squares for jc and y must be nonnegative; however, the sums of cross-products (xy) may 

be negative. . ,, t 

We now show how the analysis of covariance adjusts the response variable for the 
effect of the covariate. Consider the full model (Equation 14-15). The least squares 
estimators of /x, t„ and j8 are (x = y„, t, = y«. “ y_ " $(*/. “ *..)> and 

0 = h. (14-26) 

The error sum of squares in this model is 

SS E = E yy - (E xy flE xx (14-27) 

with a(n - 1) - 1 degrees of freedom. The experimental error variance is estimated by 

SS E 

MS E = "7- ^ -7 

a(n - 1 ) - 1 
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Now suppose that there is no treatment effect. The model (Equation 14-15) would then 
be 


y,i = fi + P(x,j - xj + e„ 


(14-28) 


and it can be shown that the least squares estimators of /x and /3 are /i = y__ and jS = 
SJS XX . The sum of squares for error in this reduced model is 

SS'e = S„ ~ (S xy ) 2 /S xx (14-29) 

with an-2 degrees of freedom. In Equation 14-29, the quantity (S xy ) 2 /S xx is the reduction 
in the sum of squares of y obtained through the linear regression of y on x. Furthermore, 
note that SS E is smaller than SS E [because the model (Equation 14-15) contains additional 
parameters {r, }] and that the quantity SS E — SS E is a reduction in sum of squares due to 
the {T/}. Therefore, the difference between SS E and SS E , that is, SS E — SS E , provides a 
sum of squares with a — l degrees of freedom for testing the hypothesis of no treatment 
effects. Consequently, to test H 0 : t, = 0, compute 


(SSk ~ SS E )/(a - 1) 
SS E /[a(n - 1) - 1] 


(14-30) 


which, if the null hypothesis is true, is distributed as F a ~ i^m-o-i- Thus, we reject 
H 0 : r, = 0 if F 0 > Fo^-^h-d-i. The P value approach could also be used. 

It is instructive to examine the display in Table 14-9 on the next page. In this table 
we have presented the analysis of covariance as an “adjusted” analysis of variance. In 
the source of variation column, the total variability is measured by S yy with an - 1 
degrees of freedom. The source of variation “regression” has the sum of squares (S xy ) 2 / 
S xx with one degree of freedom. If there were no concomitant variable, we would have 
S xy = S xx = E xy = E xx = 0. Then the sum of squares for error would be simply E yy and 
the sum of squares for treatments would be S yy — E yy = T yy . However, because of the 
presence of the concomitant variable, we must “adjust” S yy and E yy for the regression 
of y on x as shown in Table 14-9. The adjusted error sum of squares has a(n - 1) - 1 
degrees of freedom instead of a(n - 1) degrees of freedom because an additional param¬ 
eter (the slope /3) is fitted to the data. 

The computations are usually displayed in an analysis of covariance table such as 
Table 14-10 on the next page. This layout is employed because it conveniently sum¬ 
marizes all the required sums of squares and cross-products as well as the sums of squares 
for testing hypotheses about treatment effects. In addition to testing the hypothesis that 
there are no differences in the treatment effects, we frequently find it useful in interpreting 
the data to present the adjusted treatment means. These adjusted means are computed 
according to 


Adjusted y L = y L — fi{x L — xj i = 1, 2,... , a (14-31) 


where 0 = E xy /E xx . This adjusted treatment mean is the least squares estimator of fx + 
r h i = 1, 2,..., a, in the model (Equation 14-15). The standard error of any adjusted 
treatment mean is 




(14-32) 



alysis of Covariance as an “Adjusted” Analysis of Varianc 
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Adjusted Treatments 
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Finally, we recall that the regression coefficient j8 in the model (Equation 14-15) has 
been assumed to be nonzero. We may test the hypothesis H 0 :f3 = 0 by using the test 
statistic 


(E xy ) 2 IE xx 

MS e 


(14-33) 


which under the null hypothesis is distributed as F XMn -^ Thus, we reject H 0 :f3 - 0 

if F 0 > F aJ . 


EXAMPLE 14-4 . 

Consider the experiment described at the beginning of Section 14-3. Three different 
machines produce a monofilament fiber for a textile company. The process engineer is 
interested in determining if there is a difference in the breaking strength of the fiber 
produced by the three machines. However, the strength of a fiber is related to its diameter, 
with thicker fibers being generally stronger than thinner ones. A random sample of five 
fiber specimens is selected from each machine. The fiber strength (y) and the correspond¬ 
ing diameter (x) for each specimen are shown in Table 14-8. 

The scatter diagram of breaking strength versus the fiber diameter (Figure 14-3) 
shows a strong suggestion of a linear relationship between breaking strength and diam¬ 
eter, and it seems appropriate to remove the effect of diameter on strength by an analysis 
of covariance. Assuming that a linear relationship between breaking strength and di¬ 
ameter is appropriate, the model is 

y-j = » + ^ {'■ z !; 2, 3 .., s 

Using Equations 14-17 through 14-25, we may compute 
S„ 

s xx 
s xy 


Tyy 

T xx 

T X y 


" i i fa - £ = (36) 2 + (41) 2 + • • ■ + (32) 2 - ^ = 346.40 
= E E 4 ~ ^ = (20) 2 + (25) 2 + • • • + (15) 2 - = 261.73 

= E E Jc«y# - = (20)(36) + (25)(41) + • • • + (15X32) 

= - E yl-~ = \ t(207) 2 + ( 216 ) 2 + ( 180 ) 2 ] - = 140.40 

n an 5 (3)(5) 

1 3 r 2 1 13621 2 

= - E *l - — = - [(126) 2 + (130) 2 + (106) 2 ] - = 66.13 

n an 5 (3)(5) 

= - E x,y, ~ = 7 [(126)(207) + (130X216) + (106)(184)] 

n ‘ ' an 5 
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Eyy = Syy - Tyy = 346.40 - 140.40 = 206.00 

£** = - T„ = 261.73 - 66.13 = 195.60 

Exy = - T v = 282.60 - 96.00 = 186.60 

From Equation 14-29, we find 

SS' E = Syy - (S X yf/S XX 

= 346.40 - (282.60) 2 /261.73 
= 41.27 

with an - 2 = (3)(5) -2=13 degrees of freedom; and from Equation 14-27, we find 

SS E = E yy - (E xy ) 2 IE xx 

= 206.00 - (186.60) 2 /195.60 
= 27.99 

with a(n - 1) - 1 = 3(5 - 1) - 1 = 11 degrees of freedom. 

The sum of squares for testing H 0 : r x = t 2 = r 3 = 0 is 

SSe ~ SS E = 41.27 - 27.99 
= 13.28 

with a-l=3-l = 2 degrees of freedom. These calculations are summarized in 
Table 14-11 on the next page. 

To test the hypothesis that machines differ in the breaking strength of fiber produced, 
that is, H 0 : r, = 0, we compute the test statistic from Equation 14-30 as 


Fo = 


(SS^ - SS E )/(a - 1) 
SS E /[a(n - 1) - 1] 
13.28/2 6.64 


27.99/11 2.54 


= 2.61 


Comparing this to Fo. 10,2,11 = 2.86, we find that the null hypothesis cannot be rejected. 
The P value of this test statistic is P = 0.1181. Thus there is no strong evidence that the 
fibers produced by the three machines differ in breaking strength. 

The estimate of the regression coefficient is computed from Equation 14-26 as 

„ Eyy 186.60 

B = — =-= 0.9540 

P Eyy 195.60 

We may test the hypothesis // 0 :/3 = 0 by using Equation 14-33. The test statistic is 


Fo = 


(Eyyf/Eyy _ (186.60) 2 /!95 ■ 60 


MS e 


2.54 


and because F 0 .oi,i,n = 9.65, we reject the hypothesis that B = 0. Therefore, there is a 
linear relationship between breaking strength and diameter, and the adjustment provided 
by the analysis of covariance was necessary. 
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The adjusted treatment means may be computed from Equation 14-31. These ad¬ 
justed means are 

Adjusted y,. = y u - P(x L - xj 

= 41.40 - (0.9540X25.20 - 24.13) = 40.38 
Adjusted y 2 . = y 2 . ~ P(x 2 . - xj 

= 43.20 - (0.9540X26.00 - 24.13) = 41.42 

and 

Adjusted y 3 . = y 3 . - P(x 3 . — xj 

= 36.00 - (0.9540)(21.20 - 24.13) = 38.80 


Comparing the adjusted treatment means with the unadjusted treatment means (the yj, 
we note that the adjusted means are much closer together, another indication that the 
covariance analysis was necessary. 

A basic assumption in the analysis of covariance is that the treatments do not influ¬ 
ence the covariate x because the technique removes the effect of variations in the x u . 
However, if the variability in the x im is due in part to the treatments, then analysis of 
covariance removes part of the treatment effect. Thus, we must be reasonably sure that 
the treatments do not affect the values x , y . In some experiments this may be obvious from 
the nature of the covariate, whereas in others it may be more doubtful. In our example, 
there may be a difference in fiber diameter (xj between the three machines. In such 
cases, Cochran and Cox (1957) suggest that an analysis of variance on the Xy values may 
be helpful in determining the validity of this assumption. For our problem, this procedure 
yields 


66.13/2 

195.60/12 


33.07 _ 
16.30 = 


which is less than F 0 . 10 , 2,12 = 2.81, so there is no reason to believe that machines produce 
fibers of different diameters. 


Diagnostic checking of the covariance model is based on residual analysis. For the 
covariance model, the residuals are 

ey = yy ~ Sy 

where the fitted values are 

y„ = A + t,- + p(.xu - xj = y„ + [y,. - y.. - P(x L - O 

+ PiXij - xj = y,. + P(x,j - xj 

Thus, 

e u = y 0 - y,. - P(x,j - xj (14-34) 

To illustrate the use of Equation 14-34, the residual for the first observation from 
the first machine in Example 14-4 is 

e„ = yi* - yi. - P(x u - xj = 36 - 41.4 - (0.9540)(20 - 25.2) 

= 36 - 36.4392 = -0.4392 
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A complete listing of observations, fitted values, and residuals is given in the following 
table: 


Observed Value y 0 

Fitted Value y t] 

Residual e u = y u — y tj 

36 

36.4392 

-0.4392 

41 

41.2092 

-0.2092 

39 

40.2552 

-1.2552 

42 

41.2092 

0.7908 

49 

47.8871 

1.1129 

40 

39.3840 

0.6160 

48 

45.1079 

2.8921 

39 

39.3840 

-0.3840 

45 

47.0159 

-2.0159 

44 

45.1079 

-1.1079 

35 

35.8092 

-0.8092 

37 

37.7171 

-0.7171 

42 

40.5791 

1.4209 

34 

35.8092 

-1.8092 

32 

30.0852 

1.9148 


The residuals are plotted versus the fitted values Sij in Figure 14-4, versus the covariate 
Xij in Figure 14-5 (page 614) and versus the machines in Figure 14-6 (page 614). A 
normal probability plot of the residuals is shown in Figure 14-7 (page 615). These plots 
do not reveal any major departures from the assumptions, so we conclude that the co- 
variance model (Equation 14-15) is appropriate for the breaking strength data. 

It is interesting to note what would have happened in this experiment if an analysis 
of covariance had not been performed, that is, if the breaking strength data (y) had been 
analyzed as a single-factor experiment in which the covariate x was ignored. The analysis 
of variance of the breaking strength data is shown in Table 14-12 (page 615). We would 
conclude, based on this analysis, that machines differ significantly in the strength of fiber 



25 30 35 40 45 50 

Figure 14-4 Plot of residuals versus fitted values for Example 
14-4. 
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Figure 14-5 Plot of residuals versus fiber diameter x for Example 
14-4. 


produced. This is exactly opposite the conclusion reached by the covariance analysis. If 
we suspected that the machines differed significantly in their effect on fiber strength, 
then we would try to equalize the strength output of the three machines. However, in 
this problem the machines do not differ in the strength of fiber produced after the linear 
effect of fiber diameter is removed. It would be helpful to reduce the within-machine 
fiber diameter variability because this would probably reduce the strength variability in 
the fiber. 


14-3.2 Computer Solution 

There are several computer software packages available that can perform the analysis of 
covariance. The output from the Minitab General Linear Models procedure for the data 



Figure 14-6 Plot of residuals versus machine. 
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Residuals, e tj 

Figure 14-7 Normal probability plot of residuals from Example 14-4. 


in Example 14-4 is shown in Table 14-13 (on the following page). This output is very 
similar to those presented previously. In the section of the output entitled “Analysis of 
Variance,” the “Seq SS” correspond to a “sequential” partitioning of the overall model 
sum of squares, say 

SS (Model) = SS (Diameter) + SS (Machine | Diameter) 

= 305.13 + 13.28 
= 318.41 

whereas the “Adj SS” corresponds to the “extra” sum of squares for each factor, 
that is, 

SS (Machine | Diameter) = 13.28 
and 

SS (Diameter | Machine) = 178.01 

Note that SS (Machine | Diameter) is the correct sum of squares to use for testing for no 
machine effect, and SS (Diameter | Machine) is the correct sum of squares to use for 
testing the hypothesis that = 0. The test statistics in Table 14-13 differ slightly from 
those computed manually because of rounding. 

Table 14-12 Incorrect Analysis of the Breaking Strength Data as a Single-Factor Experiment 
Source of Degrees of 

Variation Sum of Squares Freedom Mean Square F 0 P Value 

Machines 140.40 2 70.20 4.09 0.0442 

Error 206.00 12 17.17 

Total 346.40 14 
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Table 14-13 Minitab Output (Analysis of Covariance) for Example 14-4 _ 

General Linear Model 

Factor Type Levels Values 

Machine fixed 3123 

Analysis of Variance for Strength, using Adjusted SS for Tests 

Source DF Seq SS Adj SS Adj MS 

Diameter 1 305.13 178.01 178.01 

Machine 2 13.28 13.28 6.64 

Error 11 27.99 27.99 2.54 

Total 14 346.40 

Term Coef StDev T 

Constant 17.177 2.783 6.17 

Diameter 0.9540 0.1140 8.36 

Machine 

1 0.1824 0.5950 0.31 

2 1.2192 0.6201 1.97 

Means for Covariates 

Covariate Mean StDev 

Diameter 24.13 4.324 

Least Squares Means for Strength 

Machine 
1 
2 

3 


The program also computes the adjusted treatment means from Equation 14-31 
(Minitab refers to those as least squares means on the sample output) and the standard 
errors. The program will also compare all pairs of treatment means using the pairwise 
multiple comparison procedures discussed in Chapter 3. 


14-3.3 Development by the General Regression Significance Test 

It is possible to develop formally the procedure for testing H 0 : t, = 0 in the covariance 
model 

y„-v + r, + - ij + €, {': 

using the general regression significance test. Consider estimating the parameters in the 
model (Equation 14-15) by least squares. The least squares function is 

i = ii by - ii- t - p(x,j - xjf 


Mean StDev 
40.38 0.7236 
41.42 0.7444 
38.80 0.7879 


P 

0.000 

0.000 

0.765 

0.075 


F P 

69.97 0.000 

2.61 0.118 


(14-36) 
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and from dL/dji = dL/dr, = dL/dfi = 0, we obtain the normal equations 

/jl: anfi. + n 2 T = y.. (14-37a) 

t n/2 + nf, + j8 X (% — 3c„) = y,. i = 1,2a (14-37b) 

/3: X f, X (x,j - xj + &S„ = S V (14-37c) 

Adding the a equations in Equation 14-37b, we obtain Equation 14-37a because 
2f =] 2" =1 (Xij - xj = 0, so there is one linear dependency in the normal equations. 
Therefore, it is necessary to augment Equations 14-37 with a linearly independent equa¬ 
tion to obtain a solution. A logical side condition is 2f =1 f;• = 0. 

Using this condition, we obtain from Equation 14-37a 

(L = y_ , (14-38a) 

and from Equation 14-37b 

*t = % ~ y.. ~ fax, ~ xj (14-3 8b) 

Equation 14-37c may be rewritten as 

X (y,. - yJ X (xtj - xj - p X (*/. - xj X (% - x_) + j 3 S xx = 

'=i ;=i i=i y=i 

after substituting for f,. But we see that 

X (Pi. ~ PJ X (Xij ~ xj = T xy 

and 

X (Xi. ~ xj X (Xij ~ xj = T xx 

Therefore, the solution to Equation 14-37c is 



which was the result given previously in Section 14-3.1, Equation 14-26. 

We may express the reduction in the total sum of squares due to fitting the model 
(Equation 14-15) as 

R(/x, t, fi) = /ly.. + X Tji. + (3S xy 

= (y..)y.. + X [Pi. ~ y~ - (ejem. - x.M. + (E xy /E xx )S xy 

= yllan + X (y| - yjy, - (E xy /E xx ) X ft. ~ xjy,. + ( EJE xx )S xy 
= yl/an + T yy - (E xy /E xx )(T xy - S xy ) 

= y 2 Jan + T yy + (E xy ) z /E xx 
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This sum of squares has a + 1 degrees of freedom because the rank of the normal 
equations is a + 1. The error sum of squares for this model is 

SS E = E S yfj ~ R(V, r, p) 

= %%yfj- y 2 J an - T yy - (E xy flE xx 

= Syy- Tyy ~ (E xy ) 2 /E xx 

= Eyy ~ (Eyyf/E^ ( 1449 ) 

with an — (a + 1) = a(n — 1) — 1 degrees of freedom. This quantity was obtained 
previously as Equation 14-27. 

Now consider the model restricted to the null hypothesis, that is, to H 0 :t x = 
r 2 = • • • = T a = 0. This reduced model is 

y t = p + -xj +«, (m^» 

This is a simple linear regression model, and the least squares normal equations for this 
model are 


anfr = y„ (14-41a) 

PS^ = S xy (1441b) 

The solutions to these equations are fi = y_ and /3 = S xy /S xx , and the reduction in the 

total sum of squares due to fitting the reduced model is 

R(H, P) = fry.. + 

= (y.)y.. + (SJS^S^ 

= yl/an + (14-42) 

This sum of squares has two degrees of freedom. 

We may find the appropriate sum of squares for testing H 0 : r x = t 2 = ■ • • = T a = 0 
as 


fl(r| /X, fr) = R(fx, t, fr) - R(ii, p) 

= yt/an + T yy + (E^f/E^ - y 2 Jan - 
= Syy - iStyflS^ - [E yy - {E^/EJ, (1443) 

using Tyy = Syy - E yy . Note that R(t\ fi, j8) has a + 1 - 2 = a - 1 degrees of freedom 
and is identical to the sum of squares given by SSe - SS E in Section 14-3.1. Thus, the 
test statistic for H 0 : r, = 0 is 

R(t\ix, P)Ha - 1) (SSI, ~ SS E )/(a ~ 1) 

0 SS E /[ain - 1) - 1] SS F J[a(n - 1) - 1] 

which we gave previously as Equation 14-30. Therefore, by using the general regression 
significance test, we have justified the heuristic development of the analysis of covariance 
in Section 14-3.1. 
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14-3.4 Factorial Experiments with Co variates 

Analysis of covariance can be applied to more complex treatment structures, such as 
factorial designs. Provided enough data exists for every treatment combination, nearly 
any complex treatment structure can be analyzed through the analysis of covariance 
approach. We now show how the analysis of covariance could be used in the most 
common family of factorial designs used in industrial experimentation, the 2 k factorials. 

Imposing the assumption that the covariate affects the response variable identically 
across all treatment combinations, an analysis of covariance table similar to the procedure 
given in Section 14-3.1 could be performed. The only difference would be the treatment 
sum of squares. For a 2 2 factorial with n replicates, the treatment sum of squares (T yy ) 
would be (1 /«) 2- =1 E? =1 y 2 - y 2 /(2)(2)«. This quantity is the sum of the sums of 
squares for factors A, B, and the AB interaction. The adjusted'treatment sum of squares 
could then be partitioned into individual effect components, that is, adjusted main effects 
sum of squares SS A and SS B , and an adjusted interaction sum of squares, SS AB . 

The amount of replication is a key issue when broadening the design structure of 
the treatments. Consider a 2 3 factorial arrangement. A minimum of two replicates is 
needed to evaluate all treatment combinations with a separate covariate for each treatment 
combination (covariate by treatment interaction). This is equivalent to fitting a simple 
regression model to each treatment combination or design cell. With two observations 
per cell, one degree of freedom is used to estimate the intercept (the treatment effect), 
and the other is used to estimate the slope (the covariate effect). With this saturated 
model, no degrees of freedom are available to estimate the error. Thus, at least three 
replicates are needed for a complete analysis of covariance, assuming the most general 
case. This problem becomes more pronounced as the number of distinct design cells 
(treatment combinations) and covariates increases. 

If the amount of replication is limited, various assumptions can be made to allow 
some useful analysis. The simplest assumption (and typically the worst) that can be made 
is that the covariate has no effect. If the covariate is erroneously not considered, the entire 
analysis and subsequent conclusions could be dramatically in error. Another choice is to 
assume that there is no treatment by covariate interaction. Even if this assumption is 
incorrect, the average affect of the covariate across all treatments will still increase the 
precision of the estimation and testing of the treatment effects. One disadvantage of this 
assumption is that if several treatment levels interact with the covariate, the various terms 
may cancel one another out and the covariate term, if estimated alone with no interaction, 
may be insignificant. A third choice would be to assume some of the factors (such as 
some two-factor and higher interactions) are insignificant. This allows some degrees of 
freedom to be used to estimate error. This course of action, however, should be under¬ 
taken carefully and the subsequent models evaluated thoroughly because the estimation 
of error will be relatively imprecise unless enough degrees of freedom are allocated for 
it. With two replicates, each of these assumptions will free some degrees of freedom to 
estimate error and allow useful hypothesis tests to be performed. Which assumption to 
enforce should be dictated by the experimental situation and how much risk the experi¬ 
menter is willing to assume. We caution that in the effects model building strategy if a 
treatment factor is eliminated, then the resulting two “replicates” of each original 2 3 are 
not truly replicates. These “hidden replicates” do free degrees of freedom for parameter 
estimation but should not be used as replicates to estimate pure error because the exe¬ 
cution of the original design may not have been randomized that way. 
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Table 14-14 Response and Covariate Data for a 2 3 with 2 Replicates 


ABC 


y_ 


-l -l 

-l -l 

i -l 

l -l 

-l l 

-l l 

l l 

l l 

-l -l 

-l -l 

l -l 

l -l 

-l l 

-l l 

l l 

l l 


4.05 -30.73 

0.36 9.07 

5.03 39.72 

1.96 16.30 

5.38 -26.39 

8.63 54.58 

4.10 44.54 

11.44 66.20 

3.58 -26.46 

1.06 10.94 

15.53 103.01 

2.92 20.44 

2.48 -8.94 

13.64 73.72 

-0.67 15.89 

5.13 38.57 


To illustrate some of these ideas, consider the 2 3 factorial design with two replicates 
and a covariate shown in Table 14-14. If the response variable y is analyzed without 
accounting for the covariate, the following model results: 

y = 25.03 + 11.20A + 18.05B + 7.24C - 18.91AB + 14.80AC 

The overall model is significant at the a = 0.01 level with R 2 = 0.786 and MS E = 
470.82. The residual analysis indicates no problem with this model except the observation 
with y = 103.01 is unusual. 

If the second assumption, common slopes with no treatment by covariate interaction, 
is chosen, the full effects model and the covariate effect can be estimated. The Minitab 
output (from the General Linear Models routine) is shown in Table 14-15 on the facing 
page. Notice that the MS E has been reduced considerably by considering the covariate. 
The final resulting analysis after sequentially removing each nonsignificant interaction 
and the main effect C is shown in Table 14-16 on the facing page. This reduced model 
provides an even smaller MS E than does the full model with the covariate in Table 14-15. 

Finally, we could consider a third course of action, assuming certain interaction terms 
are negligible. We consider the full model that allows for different slopes between treat¬ 
ments and treatment by covariate interaction. We assume that the three-factor interactions 
(both ABC and ABCx ) are not significant and use their associated degrees of freedom to 
estimate error in the most general effects model that can be fit. This is often a practical 
assumption. Three-factor and higher interactions are usually negligible in most experi¬ 
mental settings. The current version of Minitab cannot model covariates that interact with 
treatments, so we use SAS PROC GLM. The type III sums of squares are the adjusted 
sums of squares that we require. Table 14-17 (page 622) presents the SAS results for 
this model. 

With a near-saturated model, the estimate of error will be fairly imprecise. Even with 
only a few terms being individually significant at the a = 0.05 level, the overall sense 
is that this model is better than the two previous scenarios (based on R 2 and the mean 
square for error). Because the treatment effects aspect of the model is of more interest, 
we sequentially remove terms from the covariate portion of the model to add degrees of 
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Table 14-15 Minitab Analysis of Covariance for the Experiment in Table 14-14, Assuming a 
Common Slope 


General Lin 

Lear Model 






Factor 

Type 

LeveIs Values 




A 

fixed 

2 -1 1 





B 

fixed 

2 -1 1 





C 

fixed 

2 -1 1 





Analysi 

is of Variance for y, using Adjusted SS for 

■ Tests 


Source 

DF 

Seq SS 

Adj SS 

Adj MS 

F 

P 

X 

1 

12155.9 

2521 . 6 

2521 .6 

28.10 

0.001 

A 

1 

1320.7 

1 403.8 

1403.8 

15.64 

0.005 

B 

1 

3997.6 

4066.2 

4066.2 

45.31 

0.000 

C 

1 

52.7 

82.3 

' 82.3 

0.92 

0.370 

A*B 

1 

3788.3 

3641 .0 

3641 .0 

40.58 

0.000 

A*C 

1 

10.2 

1.1 

1 . 1 

0.01 

0.913 

B* C 

1 

5 . 2 

8 . 4 

8 . 4 

0.09 

0.769 

A*B*C 

1 

33.2 

33.2 

33.2 

0.37 

0.562 

Error 

7 

628.1 

628.1 

89.7 



Total 

15 

21992.0 





Term 

Coef StDev 

T 

P 



Constar 

it -1 . 

016 5.454 

-0.19 

0.858 



X 

4.9245 0.9290 

5.30 

0.001 




freedom to the estimate of error. If we sequentially remove the ACx term followed by 
BCx, the MS e decreases to 0.7336 and several terms are insignificant. The final model is 
shown in Table 14-18 (on page 623) after sequentially removing Cx, AC, and BC. 

This example emphasizes the need to have degrees of freedom available to estimate 
experimental error to increase the precision of the hypothesis tests associated with the 
individual terms in the model. This process should be done sequentially to avoid elimi¬ 
nating significant terms masked by a poor estimate of error. 


Table 14-16 Minitab Analysis of Covariance, Reduced Model for the Experiment in Table 14-14 
General Linear Model 


Factor 

Type 

Levels Values 





A 

fixed 

2 -1 1 





B 

fixed 

2 -1 1 





Analysis 

of Va r 

iance for y, u 

sing Adjusted SS f< 

)r Tests 


Source 

DF 

Seq S S 

Adj SS 

Adj MS 

F 

P 

x 

1 

12155.9 

8287.9 

8287.9 

119.43 

0.000 

A 

1 

1320.7 

1404.7 

1404.7 

20.24 

0.001 

B 

1 

3997.6 

4097.7 

4097.7 

59.05 

0.000 

A*B 

1 

3754.5 

3754.5 

3754.5 

54.10 

0.000 

Error 

11 

763.3 

763.3 

69.4 



Total 

1 5 

21992.0 





Term 

Coef StDev 

T 

P 



Constant 

-1 . 

878 3.225 

-0.58 

0.572 



x 

5.0876 0.4655 

10.93 

0.000 
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Experiment in Table 14-14, Reduced Model 
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Reviewing the results obtained from the three approaches, we note that each method 
successively improves the model fit in this example. If there is a strong reason to believe 
that the covariate does not interact with the factors, it may be best to make that assumption 
at the outset of the analysis. This choice may also be dictated by software. Although 
experimental design software packages may only be able to model covariates that do not 
interact with treatments, the analyst may have a reasonable chance of identifying the 
major factors influencing the process, even if there is some covariate by treatment inter¬ 
action. We also note that all the usual tests of model adequacy are still appropriate and 
are strongly recommended as part of the ANCOVA model building process. 


14-4 REPEATED MEASURES 

In experimental work in the social and behavioral sciences and some aspects of engi¬ 
neering and the physical sciences, the experimental units are frequently people. Because 
of differences in experience, training, or background, the differences in the responses of 
different people to the same treatment may be very large in some experimental situations. 
Unless it is controlled, this variability between people would become part of the exper¬ 
imental error, and in some cases, it would significantly inflate the error mean square, 
making it more difficult to detect real differences between treatments. 

It is possible to control this variability between people by using a design in which 
each of the a treatments is used on each person (or “subject”)- Such a design is called 
a repeated measures design. In this section we give a brief introduction to repeated 
measures experiments with a single factor. 

Suppose that an experiment involves a treatments and every treatment is to be used 
exactly once on each of n subjects. The data would appear as in Table 14-19. Note that 
the observation represents the response of subject j to treatment i and that only n 
subjects are used. The model that we use for this design is 

y„ = n + t, , + ft + e,j (14-45) 

where 77 is the effect of the ith treatment and jS, is a parameter associated with the y'th 
subject. We assume that treatments are fixed (so 2f =1 r, = 0) and that the subjects em¬ 
ployed are a random sample of subjects from some larger population of potential subjects. 
Thus, the subjects collectively represent a random effect, so we assume that the mean of 
Pj is zero and that the variance of pj is <r% Because the term Pj is common to all a 
measurements on the same subject, the covariance between y /; and y n is not, in general, 


Table 14-19 

Data for a Single-Factor Repeated Measures Design 




Subject 


Treatment 

Treatment 

1 2 

n 

Totals 

1 

yn y 12 

yi n 

yi. 

2 

YZI Y22 

yin 

y% 

a 

y a \ v«2 

y„n 

y a . 

Subject Totals 

y. i y.2 

y.n 

y.. 



14-4 REPEATED MEASURES 


625 


zero. It is customary to assume that the covariance between y tj and y Vj is constant across 
all treatments and subjects. 

Consider an analysis of variance partitioning of the total sum of squares, say 

E E (y,j ~ yJ 2 = (y.j - y.f + E E - y.f (14-46) 

<•“1 j= i j= i <=i y=i 

We may view the first term on the right-hand side of Equation 14-46 as a sum of squares 
that results from differences between subjects and the second term as a sum of squares 
of differences within subjects. That is, 

SSt = SiSgetween Subjects 4" SSwithin Subjects 

The sums of squares 55 Be tween subjects and SSwithin subjects are statistically independent, with 
degrees of freedom 

an — 1 = (ji - 1) + n(a — 1) 

The differences within subjects depend on both differences in treatment effects and 
uncontrolled variability (noise or error). Therefore, we may decompose the sum of 
squares resulting from differences within subjects as follows: 

E E (y,j ~ yjf = «E Gt - y.f + E E Ov, - % ~ y.j + x.) 2 (14-47) 


The first term on the right-hand side of Equation 14-47 measures the contribution of the 
difference between treatment means to 5S W ithm subjects, and the second term is the residual 
variation due to error. Both components of 55 Within subjects are independent. Thus, 

SSwithin Subjects = ^Treatments + SS E 

with degrees of freedom given by 

n(a - 1) = (a - 1) + (a - 1 )(n - 1) 

respectively. 

To test the hypothesis of no treatment effect, that is, 

Ho-Ti = T 2 = • • • = T a = 0 

Hi : At least one r, # 0 

we would use the ratio 


yy I re:itment/(^ 1) l^^Treatmen 

SS E /(a - l)(n - 1) “ MS e 


(14-48) 


If the model errors are normally distributed, then under the null hypothesis, = 0, 
the statistic F 0 follows an F a _ 1(a _ 1)( „^ 1) distribution. The null hypothesis would be re¬ 
jected if F 0 > F a , a - 

The analysis of variance procedure is summarized in Table 14-20 (page 626), which 
also gives convenient computing formulas for the sums of squares. Readers should rec¬ 
ognize the analysis of variance for a single-factor design with repeated measures as 
equivalent to the analysis for a randomized complete block design, with subjects consid¬ 
ered to be the blocks. 





626 



14-5 PROBLEMS 627 


14-5 PROBLEMS - 

14-1. Reconsider the experiment in Problem 5-22. Use the Box-Cox procedure to determine 
if a transformation on the response is appropriate (or useful) in the analysis of the data 
from this experiment. 

14-2. In Example 6-3 we selected a log transformation for the drill advance rate response. Use 
the Box-Cox procedure to demonstrate that this is an appropriate data transformation. 

14-3. Reconsider the smelting process experiment in Problem 8-23, where a 2 6-3 fractional 
factorial design was used to study the weight of packing material that is stuck to carbon 
anodes after baking. Each of the eight runs in the design was replicated three times, and 
both the average weight and the range of the weights at each test combination were 
treated as response variables. Is there any indication that a transformation is required for 
either response? „ 

14-4. In Problem 8-24 a replicated fractional factorial design was used to study substrate 
camber in semiconductor manufacturing. Both the mean and standard deviation of the 
camber measurements were used as response variables. Is there any indication that a 
transformation is required for either response? 

14-5. Reconsider the photoresist experiment in Problem 8-25. Use the variance of the resist 
thickness at each test combination as the response variable. Is there any indication that 
a transformation is required? 

14-6. In the grill defects experiment described in Problem 8-29, a variation of the square root 
transformation was employed in the analysis of the data. Use the Box-Cox method to 
determine if this is the appropriate transformation. 

14-7. In the central composite design of Problem 11-14, two responses were obtained, the 
mean and variance of an oxide thickness. Use the Box-Cox method to investigate the 
potential usefulness of transformation for both of these responses. Is the log transfor¬ 
mation suggested in part (c) of that problem appropriate? 

14-8. In the 3 3 factorial design of Problem 11-33 one of the responses is a standard deviation. 
Use the Box-Cox method to investigate the usefulness of transformations for this re¬ 
sponse. Would your answer change if we used the variance as the response? 

14-9. Problem 11-34 suggests using In ( s 2 ) as the response [refer to part (b)]. Does the Box- 
Cox method indicate that a transformation is appropriate? 

14-10. A soft drink distributor is studying the effectiveness of delivery methods. Three different 
types of hand trucks have been developed, and an experiment is performed in the com¬ 
pany’s methods engineering laboratory. The variable of interest is the delivery time in 
minutes (y); however, delivery time is also strongly related to the case volume delivered 
(x). Each hand truck is used four times and the data that follow are obtained. Analyze 
these data and draw appropriate conclusions. Use a = 0.05. 


Hand Truck Type 



2 


3 


y 

X 

y 

X 

? 

X 

27 

24 

25 

26 

40 

38 

44 

40 

35 

32 

22 

26 

33 

35 

46 

42 

53 

50 

41 

40 

26 

25 

18 

20 


14-11. Compute the adjusted treatment means and the standard errors of the adjusted treatment 
means for the data in Problem 14-10. 

14-12. The sums of squares and products for a single-factor analysis of covariance follow. 
Complete the analysis and draw appropriate conclusions. Use a = 0.05. 


628 


CHAPTER 14 OTHER DESIGN AND ANALYSIS TOPICS 


Source of 
Variation 

Degrees of 
Freedom 

Sums of Squares and Products 

x xy y 

Treatment 

3 

1500 

1000 

650 

Error 

12 

6000 

1200 

550 

Total 

15 

7500 

2200 

1200 


14-13. Find the standard errors of the adjusted treatment means in Example 14-4. 

14-14. Four different formulations of an industrial glue are being tested. The tensile strength of 
the glue when it is applied to join parts is also related to the application thickness. Five 
observations on strength (y) in pounds and thickness (x) in 0.01 inches are obtained for 
each formulation. The data are shown in the following table. Analyze these data and 
draw appropriate conclusions. 


_y _ x_ 

46.5 13 

45.9 14 

49.8 12 

46.1 12 

44.3 14 


Glue Formulation 


2 3 


_y _ x _ y_ 

48.7 12 46.3 

49.0 10 47.1 

50.1 11 48.9 

48.5 12 48.2 

45.2 14 50.3 


15 

14 

11 

11 

10 


y 

44.7 

43.0 

51.0 

48.1 

48.6 


16 

15 

10 

12 

11 


14-15. Compute the adjusted treatment means and their standard errors using the data in Problem 
14-14. 

14-16. An engineer is studying the effect of cutting speed on the rate of metal removal in a 
machining operation. However, the rate of metal removal is also related to the hardness 
of the test specimen. Five observations are taken at each cutting speed. The amount of 
metal removed (y) and the hardness of the specimen (x) are shown in the following table. 
Analyze the data using an analysis of covariance. Use a = 0.05. 


Cutting Speed (rpm) 


1000 

1200 


1400 


y 

X 

y 

X 

y 

X 

68 

120 

112 

165 

118 

175 

90 

140 

94 

140 

82 

132 

98 

150 

65 

120 

73 

124 

77 

125 

74 

125 

92 

141 

88 

136 

85 

133 

80 

130 


14-17. Show that in a single-factor analysis of covariance with a single covariate a 100(1 - a) 
percent confidence interval on the ith adjusted treatment mean is 


% - M - 5J ± t a 



14-5 PROBLEMS 629 


Using this formula, calculate a 95 percent confidence interval on the adjusted mean of 
machine 1 in Example 14-4. 

14-18. Show that in a single-factor analysis of covariance with a single covariate, the standard 
error of the difference between any two adjusted treatment means is 

^Adj3>,.-AdjJ ; . — |^4fiS £ 4 — ^ 

14-19. Discuss how the operating characteristic curves for the analysis of variance can be used 
in the analysis of covariance. 
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I. Cumulative Standard Normal Distribution 

II. Percentage Points of the t Distribution 

III. Percentage Points of the \ 2 Distribution 

IV. Percentage Points of the F Distribution 

V. Operating Characteristic Curves for the Fixed Effects Model Analysis of 
Variance 

VI. Operating Characteristic Curves for the Random Effects Model Analysis of 
Variance 

VII. Significant Ranges for Duncan’s Multiple Range Test 

VIII. Percentage Points of the Studentized Range Statistic 

IX. Critical Values for Dunnett’s Test for Comparing Treatments with a Control 

X. Coefficients of Orthogonal Polynomials 

XI. Random Numbers 
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.76730 

.79673 

.82381 

.84849 

.87076 

.89065 

.90824 

.92364 

.93699 

.94845 

.95818 

.96637 

.97320 

.97882 

.98341 

.98713 

.99010 

.99245 

.99430 

.99573 

.99683 

.99767 

.99831 

.99878 

.99913 

.99938 

.99957 

.99970 


.04 

.51595 

.55567 

.59483 

.63307 

.67003 

.70540 

.73891 

.77035 

.79954 

.82639 

.85083 

.87285 

.89251 

.90988 

.92506 

.93822 

.94950 

.95907 

.96711 

.97381 

.97932 

.93882 

.98745 

.99036 

.99266 

.99446 

.99585 

.99693 

.99774 

.99836 

.99882 

.99916 

.99940 

.99958 

.99971 


3.5 .99977 .99978 

3.6 .99984 .99985 

3.7 .99989 .99990 

3.8 .99993 .99993 

3.9 .99995 .99995 


.99978 .99979 .99980 
.99985 .99986 .99986 
.99990 .99990 .99991 
99993 .99994 .99994 
.99996 .99996 .99996 


«Reproduced with permission from Probability and Statistics in Engineering and Management Science , 3rd 
edition, by W. W. Hines and D. C. Montgomery, Wiley, New York, 1990. 
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I. Cumulative Standard Normal Distribution (continued) 


.51994 

.55962 

.59871 
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II. Percentage Points of the t Distribution 11 



1 .325 1.000 3.078 6.314 12.706 31.821 63.657 127.32 318.31 636.62 

2 .289 .816 1.886 2.920 4.303 6.965 9.925 14.089 23.326 31.598 

3 .277 .765 1.638 2.353 3.182 4.541 5.841 7.453 10.213 12.924 

4 .271 .741 1.533 2.132 2.776 3.747 4.604 5.598 7.173 8.610 

5 .267 .727 1.476 2.015 2.571 3.365 4.032 4.773 5.893 6.869 

6 .265 .727 1.440 1.943 2.447 3.143 3.707 4.317 5.208 5.959 

7 .263 .711 1.415 1.895 2.365 2.998 3.499 4.019 4.785 5.408 

8 .262 .706 1.397 1.860 2.306 2.896 3.355 3.833 4.501 5.041 

9 .261 .703 1.383 1.833 2.262 2.821 3.250 3.690 4.297 4.781 

10 .260 .700 1.372 1.812 2.228 2.764 3.169 3.581 4.144 4.587 

11 .260 .697 1.363 1.796 2.201 2.718 3.106 3.497 4.025 4.437 

12 .259 .695 1.356 1.782 2.179 2.681 3.055 3.428 3.930 4.318 

13 .259 .694 1.350 1.771 2.160 2.650 3.012 3.372 3.852 4.221 

14 .258 .692 1.345 1.761 2.145 2.624 2.977 3.326 3.787 4.140 

15 .258 .691 1.341 1.753 2.131 2.602 2.947 3.286 3.733 4.073 

16 .258 .690 1.337 1.746 2.120 2.583 2.921 3.252 3.686 4.015 

17 .257 .689 1.333 1.740 2.110 2.567 2.898 3.222 3.646 3.965 

18 .257 .688 1.330 1.734 2.101 2.552 2.878 3.197 3.610 3.922 

19 .257 .688 1.328 1.729 2.093 2.539 2.861 3.174 3.579 3.883 

20 .257 .687 1.325 1.725 2.086 2.528 2.845 3.153 3.552 3.850 

21 .257 .686 1.323 1.721 2.080 2.518 2.831 3.135 3.527 3.819 

22 .256 .686 1.321 1.717 2.074 2.508 2.819 3.119 3.505 3.792 

23 .256 .685 1.319 1.714 2.069 2.500 2.807 3.104 3.485 3.767 

24 .256 .685 1.318 1.711 2.064 2.492 2.797 3.091 3.467 3.745 

25 .256 .684 1.316 1.708 2.060 2.485 2.787 3.078 3.450 3.725 

26 .256 .684 1.315 1.706 2.056 2.479 2.779 3.067 3.435 3.707 

27 .256 .684 1.314 1.703 2.052 2.473 2.771 3.057 3.421 3.690 

28 .256 .683 1.313 1.701 2.048 2.467 2.763 3.047 3.408 3.674 

29 .256 .683 1.311 1.699 2.045 2.462 2.756 3.038 3.396 3.659 

30 .256 .683 1.310 1.697 2.042 2.457 2.750 3.030 3.385 3.646 

40 .255 .681 1.303 1.684 2.021 2.423 2.704 2.971 3.307 3.551 

60 .254 .679 1.296 1.671 2.000 2.390 2.660 2.915 3.232 3.460 

120 .254 .677 1.289 1.658 1.980 2.358 2.617 2.860 3.160 3.373 

oo .253 .674 1.282 1.645 1.960 2.326 2.576 2.807 3.090 3.291 


v = degrees of freedom. 

a Adapted with permission from Biometrika Tables for Statisticians, Vol. 1, 3rd edition, by E. S. Pearson and 
H. O. Hartley, Cambridge University Press, Cambridge, 1966. 
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III. Percentage Points of the \ l Distribution 3 


.995 .990 .975 .950 .500 .050 .025 .010 .005 


1 0.00 + 

2 0.01 

3 0.07 

4 0.21 

5 0.41 

6 0.68 

7 0.99 

8 1.34 

9 1.73 

10 2.16 

11 2.60 

12 3.07 

13 3.57 

14 4.07 

15 4.60 

16 5.14 

17 5.70 

18 6.26 

19 6.84 

20 7.43 


0.00 + 0.00 + 

0.02 0.05 

0.11 0.22 

0.30 0.48 

0.55 0.83 

0.87 1.24 

1.24 1.69 

1.65 2.18 

2.09 2.70 

2.56 3.25 

3.05 3.82 

3.57 4.40 

4.11 5.01 

4.66 5.63 

5.23 6.27 

5.81 6.91 

6.41 7.56 

7.01 8.23 

7.63 8.91 

8.26 9.59 


0.00 + 0.45 

0.10 1.39 

0.35 2.37 

0.71 3.36 

1.15 4.35 

1.64 5.35 

2.17 6.35 

2.73 7.34 

3.33 8.34 

3.94 9.34 

4.57 10.34 

5.23 11.34 

5.89 12.34 

6.57 13.34 

7.26 14.34 

7.96 15.34 

8.67 16.34 

9.39 17.34 

10.12 18.34 

10.85 19.34 


3.84 5.02 

5.99 7.38 

7.81 9.35 

9.49 11.14 

11.07 12.38 

12.59 14.45 

14.07 16.01 

15.51 17.53 

16.92 19.02 

18.31 20.48 

19.68 21.92 

21.03 23.34 

22.36 24.74 

23.68 26.12 

25.00 27.49 

26.30 28.85 

27.59 30.19 

28.87 31.53 

30.14 32.85 

31.41 34.17 


6.63 7.88 

9.21 10.60 

11.34 12.84 

13.28 14.86 

15.09 16.75 

16.81 18.55 

18.48 20.28 

20.09 21.96 

21.67 23.59 

23.21 25.19 

24.72 26.76 

26.22 28.30 

27.69 29.82 

29.14 31.32 

30.58 32.80 

32.00 34.27 

33.41 35.72 

34.81 37.16 

36.19 38.58 

37.57 40.00 


25 10.52 

30 13.79 

40 20.71 

50 27.99 

60 35.53 


11.52 

14.95 

22.16 

29.71 

37.48 


13.12 

16.79 

24.43 

32.36 

40.48 


14.61 

18.49 

26.51 

34.76 

43.19 


24.34 

29.34 

39.34 

49.33 

59.33 


37.65 

43.77 

55.76 

67.50 

79.08 


40.65 

46.98 

59.34 

71.42 

83.30 


44.31 

50.89 

63.69 

76.15 

88.38 


46.93 

53.67 

66.77 

79.49 

91.95 


70 43.28 45.44 48.76 

80 51.17 53.54 57.15 

90 59.20 61.75 65.65 

100 67.33 70.06 74.22 


51.74 

60.39 

69.13 

77.93 


69.33 

79.33 

89.33 

99.33 


90.53 

101.88 

113.14 

124.34 


95.02 

106.63 

118.14 

129.56 


100.42 

112.33 

124.12 

135.81 


104.22 

116.32 

128.30 

140.17 


v = degrees of freedom 

a Adapted with permission from Biometrika Tables for Statisticians, Vol. 1, 3rd edition by E. S. Pearson and 
H. O. Hartley, Cambridge University Press, Cambridge, 1966. 
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VI. Operating Characteristic Curves for the Random Effects Model Analysis 
of Variance ( continued) 



X (for a = .01) —► 1 234567 89 10 













; = .01) 
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VII. Significant Ranges for Duncan’s Multiple Range Test 11 
_ T o.oi (P> f) _ 








P 







/ 

2 

3 

4 

5 

6 

7 

8 

9 

10 

20 

50 

100 

1 

90.0 

90.0 

90 0 

90.0 

90.0 

90.0 

90.0 

90.0 

90.0 

90.0 

90.0 

90.0 

2 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

14.0 

3 

8.26 

8.5 

8.6 

8.7 

8.8 

8.9 

8.9 

9.0 

9.0 

9.3 

9.3 

9.3 

4 

6.51 

6.8 

6.9 

7.0 

7.1 

7.1 

7.2 

7.2 

7.3 

7.5 

7.5 

7.5 

5 

5.70 

5.96 

6.11 

6.18 

6.26 

6.33 

6.40 

6.44 

6.5 

6.8 

6.8 

6.8 

6 

5.24 

5.51 

S6S 

5.73 

5.81 

5.88 

5.95 

6.00 

6.0 

6.3 

6.3 

6.3 

7 

4.95 

5.22 

5.37 

5.45 

5.53 

5.61 

5.69 

5.73 

5.8 

6.0 

6.0 

6.0 

8 

4.74 

5.00 

5.14 

5.23 

5.32 

5.40 

5.47 

5.51 

5.5 

5.8 

5.8 

5.8 

9 

4.60 

4.86 

4.99 

5.08 

5.17 

5.25 

5.32 

5.36 

5.4 

5.7 

5.7 

5.7 

10 

4.48 

4.73 

4.88 

4.96 

5.06 

5.13 

5.20 

5.24 

5.28 

5.55 

5.55 

5.55 

ii 

4.39 

4^3 

4.77 

4.86 

4.94 

5.01 

5.06 

5.12 

5.15 

5.39 

5.39 

5.39 

12 

4.32 

4.55 

4.68 

4.76 

4.84 

4.92 

4.96 

5.02 

5.07 

5.26 

5.26 

5.26 

13 

4.26 

4.48 

4.62 

4.69 

4.74 

4.84 

4.88 

4.94 

4.98 

5.15 

5.15 

5.15 

14 

4.21 

4.42 

4.55 

4.63 

4.70 

4.78 

4.83 

4.87 

4.91 

5.07 

5.07 

5.07 

15 

4.17 

4.37 

4.50 

4.58 

4.64 

4.72 

4.77 

4.81 

4.84 

5.00 

5.00 

5.00 

16 

4.13 

4.34 

4.45 

4.54 

4.60 

4.67 

4.72 

4.76 

4.79 

4.94 

4.94 

4.94 

17 

4.10 

4.30 

4.41 

4.50 

4.56 

4.63 

4.68 

4.73 

4.75 

4.89 

4.89 

4.89 

18 

4.07 

4.27 

4.38 

4.46 

4.53 

4.59 

4.64 

4.68 

4.71 

4.85 

4.85 

4.85 

19 

4.05 

4.24 

4.35 

4.43 

4.50 

4.56 

4.61 

4.64 

4.67 

4.82 

4.82 

4.82 

20 

4.02 

4.22 

4.33 

4.40 

4.47 

4.53 

4.58 

4.61 

4.65 

4.79 

4.79 

4.79 

30 

3.89 

4.06 

4.16 

4.22 

4.32 

4.36 

4.41 

4.45 

4.48 

4.65 

4.71 

4.71 

40 

3.82 

3.99 

4.10 

4.17 

4.24 

4.30 

4.34 

4.37 

4.41 

4.59 

4.69 

4.69 

60 

3.76 

3.92 

4.03 

4.12 

4.17 

4.23 

4.27 

4.31 

4.34 

4.53 

4.66 

4.66 

100 

3.71 

3.86 

3.98 

4.06 

4.11 

4.17 

4.21 

4.25 

4.29 

4.48 

4.64 

4.65 

* 

3.64 

3.80 

3.90 

3.98 

4.04 

4.09 

4.14 

4.17 

4.20 

4.41 

4.60 

4.68 

/ = degrees of freedom. 











a Reproduced 

with permission from 

“Multiple Rang 

e and Multiple F Tests,” by D. B. Duncan, Biometrics, Vol. 

1, No. 

1, pp. 

1-42, 1955 
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20 
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I 

18.0 
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18.0 

18.0 
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18.0 
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18.0 

18.0 

18.0 
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18.0 

2 

6.09 
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6.09 
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4.50 

4.50 

4 

3.93 

4.01 

4.02 

4.02 

4.02 

4.02 

4.02 

4.02 

4.02 

4.02 

4.02 

4.02 

5 

3.64 

3.74 

3.79 

3.83 

3.83 

3.83 

3.83 

3.83 

3.83 

3.83 

3.83 

3.83 

6 

3.46 

3.58 

3.64 

3.68 

3.68 

3.68 

3.68 

3.68 

3.68 

3.68 

3.68 

3.68 

7 

3.35 

3.47 

3.54 

3.58 

3.60 

3.61 

3.61 

3.61 

3.61 

3.61 

3.61 

3.61 

8 

3.26 

3.39 

3.47 

3.52 

3.55 

3.56 

3.56 

3.56 

3.56 

3.56 

3.56 

3.56 

9 

3.20 

3.34 

3.41 

3.47 

3.50 

3.52 

3.52 

3.52 

3.52 

3.52 

3.52 

3.52 

10 

3.15 

3.30 

3.37 

3.43 

3.46 

3.47 

3.47 

3.47 

3.47 

3.48 

3.48 

3.48 

11 

3.11 

3.27 

3.35 

3.39 

3 43 

3.44 

3.45 

3.46 

3.46 

3.48 

3.48 

3.48 

12 

3.08 

3.23 

3.33 

3.36 

3.40 

3.42 

3.44 

3.44 

3.46 

3.48 

3.48 

3.48 

13 

3.06 

3.21 

3.30 

3.35 

3.38 

3.41 

3.42 

3.44 

3.45 

3.47 

3.47 

3.47 

14 

3.03 

3.18 

3.27 

3.33 

3.37 

3.39 

3.41 

3.42 

3.44 

3.47 

3.47 

3.47 

15 

3.01 

3.16 

3.25 
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3.38 
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3.43 
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16 

3.00 

3.15 
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3.37 

3.39 

3.41 

3.43 

3.47 

3.47 

3.47 


2.98 

3.13 
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3.36 

3.38 
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3.42 

3.47 

3.47 

3.47 

18 

2.97 

3.12 

3.21 

3.27 

3.32 

3.35 

3.37 

3.39 

3.41 

3.47 

3.47 

3.47 

19 

2.96 

3.11 

3.19 

3.26 

3.31 

3.35 

3.37 

3.39 

3.41 

3.47 

3.47 

3.47 

20 

2.95 

3.10 

3.18 

3.25 

3.30 

3.34 

3.36 

3.38 

3.40 

3.47 

3.47 

3.47 

30 

7 8Q 

3.04 

3.12 

3.20 

3.25 

3.29 

3.32 

3.35 

3.37 

3.47 

3.47 

3.47 

40 

2.86 

3.01 

3.10 

3.17 

3.22 

3.27 

3.30 

3.33 

3.35 

3.47 

3.47 

3.47 

60 

2.83 

2.98 

3.08 

3.14 

3.20 

3.24 

3.28 

3.31 

3.33 

3.47 

3.48 

3.48 

100 

2.80 

2.95 

3.05 

3.12 

3.18 

3.22 

3.26 

3.29 

3.32 

3.47 

3.53 

3.53 

00 

2.77 

2.92 

3.02 

3.09 

3.15 

3.19 

3.23 

3.26 

3.29 

3.47 

3.61 

3.67 





















VIII. Percentage Points of the Studentized Range Statistic 3 
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centage Points of the Studentized Range Statistic ( continued) 

<Jo.05(i>> /) 
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IX. Critical Values for Dunnett’s Test for Comparing Treatments with a Control 3 
tio.os(« ~~ 1> f) 

Two-Sided Comparisons 


/ 


a 

— 1 = Number of Treatment Means (excluding control) 


1 

2 

3 

4 

5 

6 

7 

8 

9 

5 

2.57 

3.03 

3.29 

3.48 

3.62 

3.73 

3.82 

3.90 

3.97 

6 

2.45 

2.86 

3.10 

3.26 

3.39 

3.49 

3.57 

3.64 

3.71 

7 

2.36 

2.75 

2.97 

3.12 

3.24 

3.33 

3.41 

3.47 

3.53 

8 

2.31 

2.67 

2.88 

3.02 

3.13 

3.22 

3.29 

3.35 

3.41 

9 

2.26 

2.61 

2.81 

2.95 

3.05 

3.14 

3.20 

3.26 

3.32 

10 

2.23 

2.57 

2.76 

2.89 

2.99 

3.07 

3.14 

3.19 

3.24 

11 

2.20 

2.53 

2.72 

2.84 

2.94 

3.02 

3.08 

3.14 

3.19 

12 

2.18 

2.50 

2.68 

2.81 

2.90 

2.98 

3.04 

3.09 

3.14 

13 

2.16 

2.48 

2.65 

2.78 

2.87 

2.94 

3.00 

3.06 

3.10 

14 

2.14 

2.46 

2.63 

2.75 

2.84 

2.91 

2.97 

3.02 

3.07 

15 

2.13 

2.44 

2.61 

2.73 

2.82 

2.89 

2.95 

3.00 

3.04 

16 

2.12 

2.42 

2.59 

2.71 

2.80 

2.87 

2.92 

2.97 

3.02 

17 

2.11 

2.41 

2.58 

2.69 

2.78 

2.85 

2.90 

2.95 

3.00 

18 

2.10 

2.40 

2.56 

2.68 

2.76 

2.83 

2.89 

2.94 

2.98 

19 

2.09 

2.39 

2.55 

2.66 

2.75 

2.81 

2.87 

2.92 

2.96 

20 

2.09 

2.38 

2.54 

2.65 

2.73 

2.80 

2.86 

2.90 

2.95 

24 

2.06 

2.35 

2.51 

2.61 

2.70 

2.76 

2.81 

2.86 

2.90 

30 

2.04 

2.32 

2.47 

2.58 

2.66 

2.72 

2.77 

2.82 

2.86 

40 

2.02 

2.29 

2.44 

2.54 

2.62 

2.68 

2.73 

2.77 

2.81 

60 

2.00 

2.27 

2.41 

2.51 

2.58 

2.64 

2.69 

2.73 

2.77 

120 

1.98 

2.24 

2.38 

2.47 

2.55 

2.60 

2.65 

2.69 

2.73 

00 

1.96 

2.21 

2.35 

2.44 

2.51 

2.57 

2.61 

2.65 

2.69 


/ = degrees of freedom. 

“ Reproduced with permission from C. W. Dunnett, “New Tables for Multiple Comparison with a Control,” 
Biometrics, Vol. 20, No. 3, 1964, and from C. W. Dunnett, “A Multiple Comparison Procedure for Comparing 
Several Treatments with a Control,” Journal of the American Statistical Association, Vol. 50, 1955. 
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IX. Critical Values for Dunnett’s Test for Comparing Treatments with a Control 
dooM-hf) 

Two-Sided Comparisons (continued) 





a- 

= Number of Treatment Means (excluding control) 



f 

1 

2 

3 

4 

5 

6 

7 

8 

9 

5 

4.03 

4.63 

4.98 

5.22 

5.41 

5.56 

5.69 

5.80 

5.89 

6 

3.71 

4.21 

4.51 

4.71 

4.87 

5.00 

5.10 

5.20 

5.28 

7 

3.50 

3.95 

4.21 

4.39 

4.53 

4.64 

4.74 

4.82 

4.89 

$ 

3.36 

3.77 

4.00 

4.17 

4.29 

4.40 

4.48 

4.56 

4.62 

9 

3.25 

3.63 

3.85 

4.01 

4.12 

4.22 

4.30 

4.37 

4.43 

10 

3.17 

3.53 

3.74 

3.88 

3.99 

4.08 

4.16 

4.22 

4.28 

11 

3.11 

3.45 

3.65 

3.79 

3.89 

3.98 

4.05 

4.11 

4.16 

12 

3.05 

3.39 

3.58 

3.71 

3.81 

3.89 

3.96 

4.02 

4.07 

13 

3.01 

3.33 

3.52 

3.65 

3.74 

3.82 

3.89 

3.94 

3.99 

14 

2.98 

3.29 

3.47 

3.59 

3.69 

3.76 

3.83 

3.88 

3.93 

15 

2.95 

3.25 

3.43 

3.55 

3.64 

3.71 

3.78 

3.83 

3.88 

16 

2.92 

3.22 

3.39 

3.51 

3.60 

3.67 

3.73 

3.78 

3.83 

17 

2.90 

3.19 

3.36 

3.47 

3.56 

3.63 

3.69 

3.74 

3.79 

18 

2.88 

3.17 

3.33 

3.44 

3.53 

3.60 

3.66 

3.71 

3.75 

19 

2.86 

3.15 

3.31 

3.42 

3.50 

3.57 

3.63 

3.68 

3.72 

20 

2.85 

3.13 

3.29 

3.40 

3.48 

3.55 

3.60 

3.65 

3.69 

24 

2.80 

3.07 

3.22 

3.32 

3.40 

3.47 

3.52 

3.57 

3.61 

30 

2.75 

3.01 

3.15 

3.25 

3.33 

3.39 

3.44 

3.49 

3.52 

40 

2.70 

2.95 

3.09 

3.19 

3.26 

3.32 

3.37 

3.41 

3.44 

60 

2.66 

2.90 

3.03 

3.12 

3.19 

3.25 

3.29 

3.33 

3.37 

120 

2.62 

2.85 

2.97 

3.06 

3.12 

3.18 

3.22 

3.26 

3.29 


2.58 

2.79 

2.92 

3.00 

3.06 

3.11 

3.15 

3.19 

3.22 

do. 05 (a- 1,/) 

One-Sided Comparisons 




a- 

= Number of Treatment Means (excluding control) 



/ 

1 

2 

3 

4 

5 

6 

7 

8 

9 

5 

2.02 

2.44 

2.68 

2.85 

2.98 

3.08 

3.16 

3.24 

3.30 

6 

1.94 

2.34 

2.56 

2.71 

2.83 

2.92 

3.00 

3.07 

3.12 

7 

1.89 

2.27 

2.48 

2.62 

2.73 

2.82 

2.89 

2.95 

3.01 

8 

1.86 

2.22 

2.42 

2.55 

2.66 

2.74 

2.81 

2.87 

2.92 

9 

1.83 

2.18 

2.37 

2.50 

2.60 

2.68 

2.75 

2.81 

2.86 

10 

1.81 

2.15 

2.34 

2.47 

2.56 

2.64 

2.70 

2.76 

2.81 

11 

1.80 

2.13 

2.31 

2.44 

2.53 

2.60 

2.67 

2.72 

2.77 

12 

1.78 

2.11 

2.29 

2.41 

2.50 

2.58 

2.64 

2.69 

2.74 

13 

1.77 

2.09 

2.27 

2.39 

2.48 

2.55 

2.61 

2.66 

2.71 

14 

1.76 

2.08 

2.25 

2.37 

2.46 

2.53 

2.59 

2.64 

2.69 

15 

1.75 

2.07 

2.24 

2.36 

2.44 

2.51 

2.57 

2.62 

2.67 

16 

1.75 

2.06 

2.23 

2.34 

2.43 

2.50 

2.56 

2.61 

2.65 

17 

1.74 

2.05 

2.22 

2.33 

2.42 

2.49 

2.54 

2.59 

2.64 

18 

1.73 

2.04 

2.21 

2.32 

2.41 

2.48 

2.53 

2.58 

2.62 

19 

1.73 

2.03 

2.20 

2.31 

2.40 

2.47 

2.52 

2.57 

2.61 

20 

1.72 

2.03 

2.19 

2.30 

2.39 

2.46 

2.51 

2.56 

2.60 

24 

1.71 

2.01 

2.17 

2.28 

2.36 

2.43 

2.48 

2.53 

2.57 

30 

1.70 

1.99 

2.15 

2.25 

2.33 

2.40 

2.45 

2.50 

2.54 

40 

1.68 

1.97 

2.13 

2.23 

2.31 

2.37 

2.42 

2.47 

2.51 

60 

1.67 

1.95 

2.10 

2.21 

2.28 

2.35 

2.39 

2.44 

2.48 

120 

1.66 

1.93 

2.08 

2.18 

2.26 

2.32 

2.37 

2.41 

2.45 


1.64 

1.92 

2.06 

2.16 

2.23 

2.29 

2.34 

2.38 

2.42 
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IX. Critical Values for Dunnett’s Test for Comparing Treatments with a Control 

do.oi(a- 1,/) 

One-Sided Comparisons (continued) 


f 


a 

— 1 = Number of Treatment Means (excluding control) 


1 

2 

3 

4 

5 

6 

7 

8 

9 

5 

3.37 

3.90 

4.21 

4.43 

4.60 

4.73 

4.85 

4.94 

5.03 

6 

3.14 

3.61 

3.88 

4.07 

4.21 

4.33 

4.43 

4.51 

4.59 

7 

3.00 

3.42 

3.66 

3.83 

3.96 

4.07 

4.15 

4.23 

4.30 

8 

2.90 

3.29 

3.51 

3.67 

3.79 

3.88 

3.96 

4.03 

4.09 

9 

2.82 

3.19 

3.40 

3.55 

3.66 

3.75 

3.82 

3.89 

3.94 

10 

2.76 

3.11 

3.31 

3.45 

3.56 

3.64 

3.71 

3.78 

3.83 

11 

2.72 

3.06 

3.25 

3.38 

3.48 

3.56 

3.63 

3.69 

3.74 

12 

2.68 

3.01 

3.19 

3.32 

3.42 

3.50 

3.56 

3.62 

3.67 

13 

2.65 

2.97 

3.15 

3.27 

3.37 

3.44 

3.51 

3.56 

3.61 

14 

2.62 

2.94 

3.11 

3.23 

3.32 

3.40 

3.46 

3.51 

3.56 

15 

2.60 

2.91 

3.08 

3.20 

3.29 

3.36 

3.42 

3.47 

3.52 

16 

2.58 

2.88 

3.05 

3.17 

3.26 

3.33 

3.39 

3.44 

3.48 

17 

2.57 

2.86 

3.03 

3.14 

3.23 

3.30 

3.36 

3.41 

3.45 

18 

2.55 

2.84 

3.01 

3.12 

3.21 

3.27 

3.33 

3.38 

3.42 

19 

2.54 

2.83 

2.99 

3.10 

3.18 

3.25 

3.31 

3.36 

3.40 

20 

2.53 

2.81 

2.97 

3.08 

3.17 

3.23 

3.29 

3.34 

3.38 

24 

2.49 

2.77 

2.92 

3.03 

3.11 

3.17 

3.22 

3.27 

3.31 

30 

2.46 

2.72 

2.87 

2.97 

3.05 

3.11 

3.16 

3.21 

3.24 

40 

2.42 

2.68 

2.82 

2.92 

2.99 

3.05 

3.10 

3.14 

3.18 

60 

2.39 

2.64 

2.78 

2.87 

2.94 

3.00 

3.04 

3.08 

3.12 

120 

2.36 

2.60 

2.73 

2.82 

2.89 

2.94 

2.99 

3.03 

3.06 

00 

2.33 

2.56 

2.68 

2.77 

2.84 

2.89 

2.93 

2.97 

3.00 



Coefficients of Orthogonal Polynomials 3 



Adapted with permission from Biometrika Tables for Statisticians, Vol. 1, 3rd edition by E. S. Pearson and H. O. Hartley, Cambridge University Press, Cambridge, 1966. 
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XI. Random Numbers 3 


10480 15011 01536 
22368 46573 25595 
24130 48360 22527 
42167 93093 06243 
37570 39975 81837 
77921 06907 11008 
99562 72905 56420 
96301 91977 05463 
89579 14342 63661 
85475 36857 53342 
28918 69578 88231 
63553 40961 48235 
09429 93969 52636 
10365 61129 87529 
07119 97336 71048 
51085 12765 51821 
02368 21382 52404 
01011 54092 33362 
52162 53916 46369 
07056 97628 33787 
48663 91245 85828 
54164 58492 22421 
32639 32363 05597 
29334 27001 87637 
02488 33062 28834 
81525 72295 04839 
29676 20591 68086 
00742 57392 39064 
05366 04213 25669 
91921 26418 64117 
00582 04711 87917 
00725 69884 62797 
69011 65795 95876 
25976 57948 29888 
09763 83473 73577 
91567 42595 27958 
17955 56349 90999 
46503 18584 18845 
92157 89634 94824 
14577 62765 35605 


02011 87647 91646 
85393 30995 89198 
97265 76393 64809 
61680 07856 16376 
16656 06121 91782 
42751 27756 53498 
69994 98872 31016 
07972 18876 20922 
10281 17453 18103 
53988 53060 59533 
33276 70997 79936 
03427 49626 69445 
92737 88974 33488 
85689 48237 52267 
08178 77233 13976 
51259 77452 16308 
60268 89368 19885 
94904 31273 04146 
58586 23216 14513 
09998 42698 06691 
14346 09172 30168 
74103 47070 25306 
24200 13363 38005 
87308 58731 00256 
07351 19731 92420 
96423 24878 82651 
26432 46901 20849 
66432 84673 40027 
26422 44407 44048 
94305 26766 25940 
77341 42206 35126 
56170 86324 88072 
55293 18988 27354 
88604 67917 48708 
12908 30883 18317 
30134 04024 86385 
49127 20044 59931 
49618 02304 51038 
78171 84610 82834 
81263 39667 47358 


69179 14194 62590 
27982 53402 93965 
15179 24830 49340 
39440 53537 71341 
60468 81305 49684 
18602 70659 90655 
71194 18738 44013 
94595 56869 69014 
57740 84378 25331 
38867 62300 08158 
56865 05859 90106 
18663 72695 52180 
36320 17617 30015 
67689 93394 01511 
47564 81056 97735 
60756 92144 49442 
55322 44819 01188 
18594 29852 71585 
83149 98736 23495 
76988 13602 51851 
90229 04734 59193 
76468 26384 58151 
94342 28728 35806 
45834 15398 46557 
60952 61280 50001 
66566 14778 76797 
89768 81536 86645 
32832 61362 98947 
37937 63904 45766 
39972 22209 71500 
74087 99547 81817 
76222 36086 84637 
26575 08625 40801 
18912 82271 65424 
28290 35797 05998 
29880 99730 55536 
06115 20542 18059 
20655 58727 28168 
09922 25417 44137 
56873 56307 67607 


a Reproduced with permission from Probability and Statistics in Engineering and Management Science, 3rd 
edition, by W. W. Hines and D. C. Montgomery, Wiley, New York, 1990. 
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XII. Alias Relationships for 2 k p Fractional Factorial Designs with k =S 15 and n s 64 


Designs with 3 Factors 

(a) 2 3 1 ; 1/2 fraction of 

3 factors in 4 runs 

Design Generators 

C = AB 

Defining relation: I = ABC 

Aliases 

A = BC 

B = AC 

C = AB 

Resolution III 

Designs with 4 Factors 

(b) 2 4 1/2 fraction of 

4 factors in 8 runs 

Design Generators 

D - ABC 

Defining relation: I = ABCD 

Aliases 

A = BCD 

B = ACD 

C = ABD 

D = ABC 

AB = CD 

AC = BD 

AD = BC 

Resolution IV 

Designs with 5 Factors 

(c) 2 5 2 ; 1/4 fraction of 

5 factors in 8 runs 

Design Generators 

D = AB E = AC 

Defining relation: / = ABD = ACE = BCDE 

Aliases 

A = BD = CE 

B = AD = CDE 

C = AE = BDE 

D = AB = BCE 

E = AC = BCD 

BC = DE = ACD = ABE 

CD = BE = ABC = ADE 

Resolution III 

(d) 2 5-1 ; 1/2 fraction of 

5 factors in 16 runs 

Each m 

Design Generators 

E = ABCD 

Defining relation: I = ABCDE 

Aliases 

ain effect is aliased with a single 4-factor interaction. 
AB = CDE BD = ACE 

AC = BDE BE = ACD 

AD = BCE CD = ABE 

AE =BCD CE =ABD 

BC = ADE DE = ABC 

2 blocks of 8: AB = CDE 

Resolution V 
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Defining relatio 




XII. Alias Relationships for 2 k p Fractional Factorial Designs with k g 15 and n ^ 64 ( continued) 
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Defining relatio: 




667 





668 




II 

ii ii ii ii ii ii ii ii ii ii ii g fc, 5 

s 3 % $ s a g § § a a * 


C R 8'feg$g£SS&? 


OOUl;a;Qi ( ,OOS;H< 

QCHJ?i<3u39'o§g 

OaqQaQaaacQOq^^-^-^ 


669 




670 




671 



XII. Alias Relationships for 2 k p Fractional Factorial Designs with k =S 15 and n € 64 (co 


Ct, t£) Q Q 

I II II II II II II II II II II II II II II II II II II II II 


P8^8Sj*§8 
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XII. Alias Relationships for 2 k p Fractional Factorial Designs with k =S 15 and n =S 64 ( continued) 


i§ii§§§§§§ 

i H ii ii H ii H H ii H H i 
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Designs with 10 Factors 
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XII. Alias Relationships for 2 k (> Fractional Factorial Designs with k =£ 15 and n 64 ( continued ) 
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XII. Alias Relationships for 2 h p Fractional Factorial Designs with k =S 15 and n 64 ( continued) 
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XII. Alias Relationships for 2 k ~ p Fractional Factorial Designs with k =£ 15 and n =£ 64 (continued) 
Designs with 12 Factors 

(w) 2 12 s ; 1/256 fraction Resolution III 

of 12 factors in 16 runs 

Design Generators 

E = ABC F = ABD G = ACD H = BCD 
J = ABCD K — AB L= AC M = AD 
Aliases 

A = HJ = BK = CL = DM 
B = GJ = AK = EL = FM 


F = CJ = DK = HL = BM 
G = BJ = HK = DL = CM 
H = AJ = GK = FL = EM 
J = DE = CF = BG = AH 
K = AB = CE = DF = GH 
L = AC = BE = DG = FH 


Designs with 13 Factors 

(x) 2 13-9 ; 1/512 fraction of Resolution III 

13 factors in 16 runs 

Design Generators 

E = ABC F = ABD G = ACD H = BCD 
J = ABCD K = AB L = AC M = AD N = BC 
Aliases 


A = HJ = BK = CL = DM = EN 


C = FJ = EK = AL = GM = BN 


E = DJ = CK = BL = HM = AM 
F = CJ = DK = HL = BM = GN 


J = DE = CF = BG = AH = MN 
K = AB = CE = DF = GH = LN 
L = AC = BE = DG = FH = KN 
M = AD = BF = CG = EH = JN 
N = BC = AE = FG = DH = KL = JM 
AF = BD = EG = CH = JL = KM 
AG = EF = CD = BH = JK = LM 
2 blocks of 8: AF = BD = EG = CH = JL = KM 
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XII. Alias Relationships for 2 k ~ p Fractional Factorial Designs with 1c =S 15 and n =S 64 ( continued) 
Designs with 14 Factors 

(y) 2 14-10 ; 1/1024 fraction of Resolution III 

14 factors in 16 runs 

D esign Generators 

E = ABC F = ABD G = ACD H = BCD J = ABCD 
K = AB L= AC M = AD N = BC O = BD 
Aliases 

A = HJ = BK = CL = DM = EN = FO 
B = GJ = AK = EL = FM = CN = DO 
C = FJ = EK = AL = GM = BN = HO 
D = EJ = FK = GL = AM = HN = BO 
E = DJ = CK = BL = HM = AN = GO 
F = CJ — DK = HL = BM = GN = AO 
G = BJ = HK = DL = CM = FN = EO 
H = AJ = GK = FL = EM = DN = CO 
J = DE = CF = BG = AH = MN = LO 
K = AB = CE = DF = GH = LN = MO 
L = AC = BE = DG = FH = KN = JO 
M = AD = BF = CG = EH = JN = KO 
N = BC = AE = FG = DH = KL = JM 
O = BD = AF = EG = CH = JL = KM 


Designs with 15 Factors 


(z) 2 15_n ; 1/2048 fraction of 
15 factors in 16 runs 

Desig n Generat ors 

E = ABC F = ABD G = ACD H = BCD J = ABCD 
K = AB L= AC M = AD N = BC O = BD P = CD 
Aliases 

A = HJ = BK = CL = DM = EN = FO = GP 


Resolution III 


C = FJ = EK = AL = GM = BN = HO = DP 
D = EJ = FK = GL = AM = HN = BO = CP 
E = DJ = CK = BL = HM = AN = GO = FP 
F = CJ = DK = HL = BM — GN = AO = EP 
G = BJ = HK = DL = CM = FN = EO = AP 
H = AJ = GK = FL = EM = DN = CO = BP 
J = DE = CF = BG = AH = MN = LO = KP 
K = AB = CE = DF = GH = LN = MO = JP 
L = AC = BE = DG = FH = KN = JO = MP 


P = CD = EF = AG = BH = JK = LM = NO 
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2 2 factorial design, 5, 219 

2 3 factorial design, 6, 228 

2 4 factorial design, 7, 246 

2* factorial design, 7, 218, 242 

2*’ 1 designs, 304-317 

2*' 2 designs, 317 

2 t ' p designs, 326 

3* designs, 363, 372 

3*' 1 design, 379 

3*''’ designs, 382 

A-optimal design, 468 
aberration, 326 

additive models, 135, 145, 432 
additivity of the randomized block model, 136. See 
also testing for nonadditivity 
adequate prediction criterion, 104 
adjusted R 2 statistic, 104, 411. See also R 2 
adjusted treatment effects, 157, 161 
adjusted treatment means in covariance analysis, 607 
adjusted treatment totals, 157 
agricultural heritage, 17, 18 
alphabetic design optimality, 468-469 
alias, 304 

alternate fraction, 305, 318 
alternative hypothesis, 34 
analysis algorithm for the 2* design, 242 
analysis of covariance, 15,126, 604 
analysis of variance, 60, 63, 66 
analytical determination of a transformation, 590 
antagonistic blending in mixtures, 476 
approximate confidence intervals on variance 
components, 543 

approximate F-tests, 535, 539, 540 
axial runs, 274, 365, 478 

balanced design, 154, 558, 600 
balanced incomplete block designs, 154 
Bartlett’s test for equality of variances, 81 
basic design, 307, 317, 381 
best-guess approach to experimentation, 3 
between treatment variability, 66 
binary blends, 472 

blocking, 12, 13, 15, 50, 126, 127, 130, 207, 209, 287, 
289, 296, 298, 315, 331, 373, 462, 574, 576, 
579, 580, 604 

blocking in response surface designs, 462-466 
Bonferroni method of simultaneous confidence 
intervals, 75 

boundary point designs, 478 
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box plot, 23, 62 
Box-Behnken design, 458 
Box-Cox method, 590 


canonical analysis of a second-order response surface 
model, 440 

center points, 271, 365, 431, 458, 461 
central composite design, 11, 274, 275, 365, 441, 456 
central limit theorem, 30 
central tendency, 22 

change in mean effect in EVOP, 484, 488 
characterization experiment, 8. See also screening 
experiments 
check blends, 478 

checking assumptions, 38, 76-86, 135, 185, 224, 242, 
251, 258, 261, 416 
chi-square distribution, 30 
Cochran’s theorem, 69 
coded variables, 172, 223, 431 
coding the data in an analysis of variance, 72 
coefficient of variation, 104 
combined array design, 492, 494 
combining fractions to estimate effects, 303, 306, 315, 
339, 347, 348 

combining interblock and intrablock information, 162 
comparative experiment, 21 
comparing treatments with a control, 103 
comparison of all contrasts, 95 
complete blocks, 127 
complete confounding, 299 
completely randomized experimental design, 33, 64, 
176, 207 

complimentary fraction. See alternate fraction 
components of interaction, 204, 366 
components of variance, 65, 511, 512 
computations in the analysis of variance, 70, 72 
computer-generated designs, 409, 466, 479 
concomitant variable, 604 

conditional inference chart for unreplicated factorials, 
253 

confidence coefficient, 42 
confidence interval, 33, 42 
confidence interval on mean response in regression 
model, 416 

confidence interval on treatment mean, 74, 75, 100 
confidence intervals on variance components, 516, 491 
exact intervals, 516, 543 
maximum likelihood procedures, 552 
modified large-sample method, 545 
Satterthwaite-type approximate intervals, 543 







INDEX 


681 


confidence limits, 42 
confirmation tests, 17, 317 
confounding in the 2* design, 288 
four blocks, 296 
more than four blocks, 297 
two blocks, 289 

confounding in the 3* design, 373 
more than nine blocks, 378 
nine blocks, 377 
three blocks, 373 

connection between analysis of variance and 
regression, 112 

construction of fractional factorials, 307, 318, 337, 
379-381 

continuous probability distribution, 24 

continuous random variable, 22 

contour plot, 10, 204 

contrasts, 90, 93, 221, 231, 290 

control treatment, 103 

controllable factors, 2, 14, 15, 489, 493 

Cook’s D, 420 

corrected sum of squares, 28 

covariance, 26 

covariance matrix, 397 

covariate, 604 

criterion for selection of a design, 455 

critical region, 34 

crossed array design, 491 

crossed factor, 170. See also factorial experiments 

crossover designs, 150 

cube plot, 242, 262 

cubic effects, 88 

cuboidal designs, 450 

curvature, 174, 272, 432 

cycle in EVOP, 484 

D-optimal design, 468 
data snooping, 94 

data transformation, 40, 81, 84-86, 257 
dealiasing interactions, 306, 315, 339, 348, 407. See 
also fold-over 

defining contrast, 290, 296, 373 
defining relation for a fractional factorial, 304, 318, 379 
degrees of freedom, 29, 30, 31 
dependent variable, 392 
design augementation, 409. See also dealiasing 
interactions and fold-over of fractional 
factorials 

design criteria, 455 

design generator. See generator for fractional factorial 

design matrix, 228 

design projection, 247, 303, 310 

design resolution, 308 

designs for a first-order model, 455 

desirability functions, 451—454 

direction of steepest ascent. See steepest ascent 

discrete probability distribution, 24 

discrete random variable, 22 

dispersion effects, 110, 241, 260, 264, 323 

dot diagram, 21, 22 


Duncan’s multiple range test, 100 
Dunnett’s test, 103 

duplicate measurements on the response, 265 

effect estimates, 220 

magnitude and direction, 221 
effects model, 64, 128, 145, 177 
eigenvalues, 440 
eigenvectors, 446 
empirical model, 17, 87, 392 
empirical selection of a transformation, 81, 84 
equiradial designs, 461 
error, 22 

estimable functions, 113, 114, 188, 189 
estimate, 27 

estimating missing values, 139, 148, 602 
estimation of model parameters in the analysis of 
variance, 74, 112, 185 
estimation of variance components, 513 
analysis of variance method, 513, 518, 524 
maximum likelihood, 547 
restricted maximum likelihood method, 549 
estimator, 27 

evolutionary operation (EVOP), 484-488 
exchange algorithm, 469 
expected mean squares, 68, 179 
rules for, 531 

expected value operator, 25 
expected value, 25 
experiment-wise error rate, 75 
experimental design and product design, 8,11 
experimental error, 11, 12, 34, 64 
experimental unit, 13, 64, 126 
experimenter, 2 

exponential family of distributions, 594 
extra sums of squares, 412 

face-centered cube, 459 
factor effect, 5, 6, 220 
factor levels, 14, 21, 60, 245 
factorial experiment, 4, 170, 218 
in blocks, 207, 287 
in split-plots, 578 
factors, 1, 2, 3, 5, 14, 21, 60 
/■-distribution, 32 
first-order model, 226, 427, 455 
fixed effects model, 65, 511 
fold-over of fractional factorials, 339, 340, 347, 348. 

See also dealiasing of interactions 
follow-up experiments. See confirmation tests 
fractional factorial experiment, 7, 303, 379 
fractional factorial family, 306 
full fold-over, 340 
full model, 115, 116, 142, 413 

G-optimal design, 468 

gauge capability studies, 519, 524 

general regression significance test, 114, 141, 626 

generalized interaction, 296, 299, 377 

generalized linear model, 594 
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generating relation, 318 
generator for fractional factorial, 304, 318 
geometric notation for factorials, 228 
Graeco-Latin square designs, 151, 383 
graphical comparison of means, 89 
guidelines for designing experiments, 13 

half-normal plot of effects, 253 

hat matrix, 417 

held-constant factors, 14 

hexagon design, 461 

hidden replication, 247, 619 

hierarchical design. See nested designs 

hierarchical model, 203, 286 

hierarchy principle in model-building, 203, 286 

histogram, 23 

hybrid designs, 461 

hypersquares, 154 

hypothesis testing, 21, 33, 409 

/-component of interaction, 366 
identity element, 231 

importance of nonstatistical knowledge, 19 
inaccurate levels in design factors, 405 
inequality of variance assumption, 80 
incomplete blocks, 154 

independence assumption in the t-test and the analysis 
of variance, 38-40, 79 
independent random variables, 26 
independent variable, 392 
indicator variables, 203 

industrial experiments versus agricultural experiments, 
18 

inference space of an experiment, 511 

influence and leverage, 419, 420 

inner array, 491 

interaction, 4, 137, 171, 174 

interaction between treatments and blocks, 137 

interblock analysis, 161 

intrablock analysis, 161 

intrablock error, 163 

irregular fraction, 346 

iterative nature of experimentation, 17, 20. See also 
sequential experimentation 

/-component of interaction, 366 

Kruskal-Wallis test, 116 

lack of fit, 239, 272,421 
Latin square design, 144, 148, 209, 365 
least significance difference (LSD) method of 
comparing means, 99 

least squares estimation of parameters, 88, 112, 141, 
159, 186, 394 

least squares normal equations, 112, 141, 159, 
186-187, 395 

Lenth’s method for unreplicated designs, 254 
Levene’s test for equality of variance, 82 


leverage points, 420. See also influence and leverage 
linear blending, 476 

linear model, 64. See also statistical model 

linear predictor, 594 

link function, 595 

location effects. 111, 260, 323 

location, as in central tendency, 22 

main effect, 5, 170, 220 

maximum likelihood estimation, 547, 549, 595 

mean, 12, 25 

mean squares, 68 

means model, 64, 128, 177 

method of least squares. See least squares estimation of 
parameters 

method of moments, 547 

method of steepest ascent. See steepest ascent 

method of unweighted means, 603 

minimal designs of resolution IV, 347 

minimum abberation design, 326 

minimum variance estimator, 27 

missing observations in a 2 k factorial, 404 

missing observations in a randomized block design, 

139 

missing observations in a Latin square design, 148 
mixed factor levels in a factorial design, 383 
mixed model, 522, 559, 569 
alternate forms, 526 
estimation of variance components, 524 
restricted model, 523 
standard error of fixed effect mean, 524 
unrestricted model, 526 
mixture experiments, 472 
mixture models, 476 
model hierarchy, 203, 286 
modified large-sample method, 545 
modified Levene test. See Levene’s test 
moment estimator, 547 
m-stage nested design, 566 
multiple comparisons, 88, 133, 182 
multiple linear regression model, 393 
multiple responses, 448 

natural factor levels, 224, 226, 431 

nested designs, 557-568 

nested-factorial design, 569 

Newman-Keuls test, 102 

no-interaction factorial model, 190 

noise factors. See uncontrollable factors 

noise reduction property of blocking, 51, 132-133 

noncentral F random variable, 107 

nonconstant variance, 38, 44, 79, 80 

nongeometric designs, 343 

nonlinear programming, 451 

nonparametric approach to the analysis of variance, 

116 

normal distribution, 29 

normal equations. See least squares normal equations 
normal probability plot of effects, 246, 264 
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normal probability plots, 38, 72 
normality assumption in /-tests and the analysis of 
variance, 38, 77 
nuisance variable, 13, 126 
null hypothesis, 34 

O.C. curve. See operating characteristic curve 

Ockham’s razor, 310 

one-at-a-time confidence intervals, 75 

one observation per cell, 191 

one-factor-at-a-time experiments, 4 

one-half fraction, 7, 304 

one-sided alternative hypothesis, 35 

one-way analysis of variance, 64 

operating characteristic curve, 40, 107, 139, 189, 529 

optimal designs, 468 

optimization of a process, 9, 427 

optimum conditions, 427,429, 430,436 

orthogonal contrast, 93, 221, 231 

orthogonal designs, 231, 404, 455 

orthogonal effects, 221, 231 

orthogonal Latin squares, 151, 365 

orthogonality, 93, 221, 231 

outer array, 491 

outliers, 78, 417 

overall mean, 64 

pair-wise comparison of means, 96-104 

paired comparison design, 47, 50 

paired r-test, 49 

partial aliasing, 344, 380, 383 

partial confounding, 299 

partial E-test, 413 

partial fold-over, 349 

partial regression coefficients, 393 

path of steepest ascent. See steepest ascent 

pentagon design, 461 

phase in EVOP, 484 

Plackett-Burman designs, 343-347 

plotting residuals, 76-86 

point of maximum response, 436, 437 

point of minimum response, 436, 438 

pooled /-test. See two-sample /-test 

pooling mean squares to estimate error, 535 

power, 34 

power family transformations, 84, 591 

practical significance versus statistical significance, 19 

prediction interval, 416 

prediction variance as a design criterion, 455, 457, 46! 

pre-experimental planning, 14, 16 

PRESS residuals, 418 

PRESS, 104,418 

principal block, 291, 297, 373 

principal fraction, 305 

probability distribution, 23 

process robustness studies, 1, 127, 176, 488 

projection of factorials, 246, 303, 307, 331 

projectivity, 345 

propagation of error, 495 


properties of least squares estimators, 397 

proportional data in an unbalanced factorial, 600 

pseudocomponents, 479 

pseudo /Mests. See approximate E-tests 

pure blends, 474 

pure error, 239 

P-values, 37 

quadratic effects, 88, 204, 432 

qualitative factors, 86, 201, 275, 368 

quantitative factors, 86, 171, 201, 272, 363, 368, 511 

/{-student, 419 

R 2 , 104, 411. See also adjusted R 2 
R 2 for prediction, 104, 419 
random effects model, 65, 511, 512, 557 
random sampling, 26 
random variable, 22 

randomization restriction, 130, 145, 207, 209 
randomization tests, 40, 73 
randomization, 12, 13, 61, 126, 148 
randomized block design, 50, 126, 207 
complete blocks, 127 
incomplete blocks, 154 

randomized experiment. See completely randomized 
experiment 

rank transformation, 117, 118 
reduced model, 116, 143, 413, 520 
reference distribution, 35 

regression model, 87, 172, 177, 201, 204, 223, 235, 
364, 392, 604 
regressor variable, 392 
rejection region. See critical region 
relative efficiency of factorials, 174 
relative information for confounded effects, 300 
repeatability, 519 

repeated measures experiment, 624 
replication, 5, 12, 16, 60, 247 
replication of Latin squares, 148 
reproducibility, 519 

residual analysis, 76, 79, 185, 224, 400, 416, 563 
residual sum of squares, 397 
residual treatment effects, 150 
resolution in designs, 306, 337 
resolution IV designs, 306, 347 
resolution of a fractional factorial. See design 
resolution 

resolution V designs, 307, 347 
response model for robust design, 492 
response surface designs, 11, 364, 429 
response surface methodology (RSM), 11, 427 
response surface, 10, 173, 201, 204, 225, 235, 364, 
393, 427 

response variable, 1, 2, 14, 15, 392 
restricted form of the mixed model, 523, 531, 539, 
559, 569 

restriction on randomization. See randomization 
restriction; see also blocking 
ridge systems, 447 
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rising ridge, 448 

robust design, 488. See also process robustness studies 

robust process, 1. See also process robustness studies 

rotatable design, 457 

^-Student, 419 

rules for expectation, 26 

rules for expected mean squares, 531 

saddle point, 436, 439 
sample mean, 12, 26, 27 

sample size determination, 40, 107-110, 139, 189, 529 
sample standard deviation, 27 
sample variance, 26, 27 
sampling distribution, 29 

Satterthwaite’s method. See approximate F-tests 

saturated designs, 337 

scaled residuals, 417 

scatter diagram, 662 

Scheffe’s test, 95 

screening experiment, 9, 15, 218, 303 
second-order designs, 456 
second-order response surface model, 429 
sequential experimentation, 10, 17,18, 20, 303, 315, 
365, 429, 456 
signal-to-noise ratio, 491 
significance level, 34, 37 
significance testing. See hypothesis testing 
simplex centroid designs for mixtures, 475 
simplex design for the first-order model, 456 
simplex lattice designs for mixtures, 474 
simultaneous confidence interval, 75 
single replicate, 191, 244. See also one observation per 
cell 

single-factor analysis of variance, 64 
simultaneous confidence intervals, 75, 95 
skewed distribution, 30 
slope of the response surface, 493 
small central composite designs, 461 
sparsity of effects principle, 245, 303 
spherical designs, 457 
split-plot design, 557, 573, 578 
split-split plot designs, 580 
staggered nested designs, 566 
standard error of a regression coefficient, 239, 412 
standard error of effects in a 2*, 241 
standard Latin square, 148 
standard normal distribution, 29 
standard order, 223, 228, 242 
standardized contrast, 92 
standardized residuals, 78, 417 
stationary point, 436, 440 
stationary ridge, 448 
statistical error, 22, 64, 412 
statistical model, 34, 48, 64, 87, 128, 155, 177, 190, 
191, 194, 207, 210, 393, 427, 429, 436, 476, 
492, 493, 512, 517, 522, 526, 558, 567, 569, 
575, 576, 578, 581, 584, 594, 605, 624 
steepest ascent, 227, 430, 435 
strategy of experimentation, 1, 3 
strength of a transformation, 84 


strip-split-plot design, 583 
studentized range statistic, 97, 102 
studentized residuals, 418 
subplot error, 575, 579, 584 
subplots, 574, 579 
subsampling, 577 
symmetric designs, 155 
synergistic blending in mixtures, 476 

(-distribution, 31 

team approach to designing experiments, 14 
test for interaction (nonadditivity), 192 
testing for significance of regression, 409 
three-stage nested design, 566 
total effect of a factor, 221 

transformation to correct violations of assumptions, 40, 
81, 84-86, 257, 590 
transmission of error, 492, 495 
treatment effect, 64 
treatments orthogonal to blocks, 139 
treatments, 21, 60 
Tukey’s test, 96 

two-factor analysis of variance, 177 
two-sample (-test, 35 
unequal variances, 44 
two-sided alternative hypothesis, 34 
two-stage nested design, 557 
Type III sums of squares, 620 

unbalanced factorial design, 600 
unbalanced data in the analysis of variance, 75, 600 
unbalanced design, 75, 144, 600 
unbiased estimator, 27 
uncontrollable factors, 2, 15, 489 
unplanned experiments, 392 
unreplicated design. See single replicate 
unrestricted form of the mixed model, 526, 540, 573 
unweighted means methods in analysis of variance, 
603 

V-optimal design, 468 
variability, 22, 323 
variance components, 512, 518, 565 
variance operator, 25 

variance stabilizing transformations, 81, 84-86, 257 
variance, 25 

IL-component of interaction, 372 
weighted least squares, 595 
weighted squares of means method, 603 
whole plot error, 575, 579, 584 
whole plots, 574, 579, 583 
within treatment variability, 66 
words in defining relation, 318 

X-component of interaction, 372 

Yates’ order. See standard order 
F-component of interaction, 372 
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